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DNA REPLICATION PROTEINS OF GRAM POSITIVE BACTERIA AND 
THEIR USE TO SCREEN FOR CHEMICAL INHIBITORS 

The present application is a continuation-in-part of U.S. Patent 
Application Serial No. 09/235,245 filed January 22, 1999, which claims benefit of 
U.S. Provisional Patent Application Serial No. 60/093,727 filed July 22, 1998, and 
U.S. Provisional Patent Application Serial No. 60/074,522 filed January 22, 1998, all 
of which are hereby incorporated by reference. The present application also claims 
benefit of U.S. Provisional Patent Application Serial No. 60/146,178 filed July 29, 
1999, which is hereby incorporated by reference. 

The present invention was made with funding from National Institutes 
of Health Grant No. GM38839. The United States Government may have certain 
rights in this invention. 

FIELD OF THE INVENTION 

This invention relates to genes and proteins that replicate the 
chromosome of Gram positive bacteria. These proteins can be used in sequencing, 
amplification of DNA, and in drug discovery to screen large libraries of chemicals for 
identification of compounds with antibiotic activity. 

BACKGROUND OF THE INVENTION 

All forms of life must duplicate the genetic material to propagate the 
species. The process by which the DNA in a chromosome is duplicated is called 
replication. The replication process is performed by numerous proteins that 
coordinate their actions to duplicate the DNA smoothly. The main protein actors are 
as follows (reviewed in Kornberg et al., DNA Replication , Second Edition, New 
York: W.H. Freeman and Company, pp. 165-194 (1992)). A helicase uses the energy 
of ATP hydrolysis to unwind the two DNA strands of the double helix. Two copies of 
the DNA polymerase use each "daughter" strand as a template to convert them into 
two new duplexes. The DNA polymerase acts by polymerizing the four monomer unit 
building blocks of DNA (the 4 dNTPs, or deoxynucleoside triphosphates are: dATP, 
dCTP, dGTP, dTTP). The polymerase rides along one strand of DNA using it as a 
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template that dictates the sequence in which the monomer blocks are to be 
polymerized. Sometimes the DNA polymerase makes a mistake and includes an 
incorrect nucleotide (e.g., A instead of G). A proofreading exonuclease examines the 
polymer as it is made and excises building blocks that have been improperly inserted 
5 in the polymer. 

Duplex DNA is composed of two strands that are oriented antiparallel 
to one another, one being oriented 3 '-5' and the other 5' to 3'. As the helicase 
unwinds the duplex, the DNA polymerase moves continuously forward with the 
helicase on one strand (called the leading strand). However, due to the fact that DNA 

10 polymerases can only extend the DNA forward from a 3' terminus, the polymerase on 

the other strand extends DNA in the opposite direction of DNA unwinding (called the 
lagging strand). This necessitates a discontinuous ratcheting motion on the lagging 
strand in which the DNA is made as a series of Okazaki fragments. DNA 
polymerases cannot initiate DNA synthesis de «ovo,.but require a primed site (i.e., a 

15 short duplex region). This job is fulfilled by pnmase, a specialized RNA polymerase, 

that synthesizes short RNA primers on the lagging strand. The primed sites are 
extended by DNA polymerase. A single-stranded DNA binding protein ("SSB") is 
also needed; it operates on the lagging strand. The function of SSB is to coat single 
stranded DNA ("ssDNA"), thereby melting short hairpin duplexes that would 

20 otherwise impede DNA synthesis by DNA polymerase. 

The replication process is best understood for the Gram negative 
bacterium Escherichia coli and its bacteriophages T4 and T7 (reviewed in Kelman et 
ah, "DNA Polymerase III Holoenzyme: Structure and Function of Chromosomal 
Replicating Machine," Annu. Rev. Biochem. , 64:171-200 (1995); Marians, K.J., 

25 "Prokaryotic DNA Replication," Annu. Rev. Biochem. , 6 1 :673-7 1 9 (1 992); McHenry, 

C.S., "DNA Polymerase III Holoenzyme: Components, Structure, and Mechanism of 
a True Replicative Complex," J. Bio. Chem. , 266.19127-19130 (1991); Young et al., 
"Structure and Function of the Bacteriophage T4 DNA Polymerase Holoenzyme," 
Am. Chem. Soc , 31 :8675-8690 (1992)). The eukaryotic systems of yeast 

30 (Saccharomyces cerevisae) (Morrison et al., "A Third Essential DNA Polymerase in 

S. cerevisiae," Cell , 62:1 143-51 (1990) and humans (Bambara et al., "Reconstitution 
of Mammalian DNA Replication," Prog. Nuc. Acid Res. ," 51:93-123 (1995)) have 
also been characterized in some detail as has herpes virus (Boehmer et al., "Herpes 
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Simplex Virus DNA Replication," Annu. Rev. Biochem. , 66:347-384 (1997)) and 
vaccinia virus (McDonald et al., "Characterization of a Processive Form of the 
Vaccinia Virus DNA Polymerase," Virology , 234: 1 68-1 75 ( 1 997)). The helicase of E. 
coli is encoded by the dnaB gene and is called the DnaB-helicase. In phage T4, the 
helicase is the product of the gene 41 , and, in T7, it is the product of gene 4. 
Generally, the helicase contacts the DNA polymerase in E. coli. This contact is 
necessary for the helicase to achieve the catalytic efficiency needed to replicate a 
chromosome (Kim et al., "Coupling of a Replicative Polymerase and Helicase: A tau- 
DnaB Interaction Mediates Rapid Replication Fork Movement," Cell , 84:643-650 
(1996)). The identity of the helicase that acts at the replication fork in a eukaryotic . 
cellular system is still not firm. 

The primase of E. coli (product of the dnaG gene), phage T4 (product 
of gene 61), and T7 (gene 4) require the presence of their cognate helicase for activity. 
The primase of eukaryotes, called DNA polymerase alpha, looks and behaves 
differently. DNA polymerase alpha is composed of 4 subunits. The primase activity 
is associated with the two smaller subunits, and the largest subunit is the DNA 
polymerase which extends the product of the priming subunits. DNA polymerase 
alpha does not need a helicase for priming activity on single strand DNA that is not 
coated with binding protein. 

The chromosomal replicating DNA polymerase of all these systems, 
prokaryotic and eukaryotic, share the feature that they are processive, meaning they 
remain continuously associated with the DNA template as they link monomer units 
(dNTPs) together. This catalytic efficiency can be manifest in vitro by their ability to 
extend a single primer around a circular ssDNA of over 5,000 nucleotide units in 
length. Chromosomal DNA polymerases will be referred to here as replicases to 
distinguish them from DNA polymerases that function in other DNA metabolic 
processes and are far less processive. 

There are three types of replicases known thus far that differ in how 
they achieve processivity and how their subunits are organized. These will be referred 
to here as Types MIL The Type I is exemplified by the phage T5 replicase, which is 
composed of only one subunit yet is highly processive (Das et al., "Mechanism of 
Primer-template Dependent Conversion of dNTP-dNMP by T7 DNA Polymerase," J. 
Biol. Chem. , 255:7149-7154 (1980)). It is possible that the T5 enzyme achieves 
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processivity by having a cavity within it for binding DNA, with a domain of the 
protein acting as a lid that opens to accept the DNA and closes to trap the DNA inside, 
thereby keeping the polymerase on DNA during polymerization of dNTPs. Type II is 
exemplified by the replicases of phage T7, herpes simplex virus, and vaccinia virus. 
In these systems, the replicase is composed of two subunits, the DNA polymerase and 
an "accessory protein" which is needed for the polymerase to become highly efficient. 
It is presumed that the DNA polymerase binds the DNA in a groove and that the 
accessory protein forms a cap over the groove, trapping the DNA inside for processive 
action. Type IE is exemplified by the replicases of E. coli, phage T4, yeast, and 
humans in which there are three separate components, a sliding clamp protein, a 
clamp loader protein complex, and the DNA polymerase. In these systems, the sliding 
clamp protein is an oligomer in the shape of a ring. The clamp loader is a 
multiprotein complex which uses ATP to assemble the clamp around DNA. The 
DNA polymerase then binds the clamp which tethers the polymerase to DNA for high 
processivity. The replicase of the E. coli system contains a fourth component called 
tau that acts as a glue to hold two polymerases and one clamp loader together into one 
structure called Pol EI*. In this application, any replicase that uses a minimum of 
three components (i.e., clamp, clamp loader, and DNA polymerase) will be referred to 
as either a three component polymerase, a type m enzyme, or a DNA polymerase HI- 
type replicase. 

The E. coli replicase is also called DNA polymerase ITJ holoenzyme. 
The holoenzyme is a single multiprotein particle that contains all the components; it is 
comprised often different proteins. This holoenzyme is suborganized into four 
functional components called: 1) Pol m core (DNA polymerase); 2) gamma complex 
or tau/gamma complex (clamp loader); 3) beta subunit (sliding clamp); and 4) tau 
(glue protein). The DNA polymerase ITJ "core" is a tightly associated complex 
containing one each of the following three subunits: 1 ) the alpha subunit is the actual 
DNA polymerase (129 kDa); 2) the epsilon subunit (28 kDa) contains the 
proofreading 3'-5' exonuclease activity; and 3) the theta subunit has an unknown 
function. The gamma complex is the clamp loader and contains the following 
subunits: gamma, delta, delta prime, chi and psi (U.S. Patent No. 5,583,026 to 
O'Donnell). Tau can substitute for gamma, as can a tau/gamma heterooligomer. The 
beta subunit is a homodimer and forms the ring shaped sliding clamp. These 
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components associate to form the holoenzyme and the entire holoenzyme can be 
assembled in vitro from 10 isolated pure subunits (U.S. Patent No. 5,583,026 to 
O'Donnell; U.S. Patent No. 5,668,004 to O'Donnell). The E. coli dnaX gene encodes 
both tau and gamma. Tau is the product of the full gene. Gamma is the product of the 
5 first 2/3 of the gene; it is truncated by an efficient translational frameshift that results 

in incorporation of one unique residue followed by a stop codon. 

The tau subunit, encoded by the same gene that encodes gamma 
(dnaX), also acts as a glue to hold two cores together with one gamma complex. This 
subassembly is called DNA polymerase m star (Pol HI*). One beta ring interacts 

10 with each core in Pol ID* to form DNA polymerase ETJ holoenzyme. 

During replication, the two cores in the holoenzyme act coordinately to 
synthesize both strands of DNA in a duplex chromosome. At the replication fork, 
DNA polymerase m holoenzyme physically interacts with the DnaB helicase through 
the tau subunit to form a yet larger protein complex termed the "replisome" (Kim et 

15 al., "Coupling of a Replicative Polymerase and Helicase: A tau-DnaB Interaction 

Mediates Rapid Replication Fork Movement," Cell, 84:643-650 (1996); Yuzhakov et 
al., "Replisome Assembly Reveals the Basis for Asymmetric Function in Leading and 
Lagging Strand Replication," Cell, 86:877-886 (1996)). The primase repeatedly 
contacts the helicase during replication fork movement to synthesize RNA primers on 

20 the lagging strand (Marians, K.J., "Prokaryotic DNA Replication," Annu. Rev. 

Biochem. , 6 1 :673-7 19(1 992)). 

Intensive subtyping of prokaryotic cells has now lead to a taxonomic 
classification of prokaryotic cells as eubactena (true bacteria) to distinguish them 
from archaebacteria. Within eubacteria are many different subcategories of cells, 

25 although they can broadly be subdivided into Gram positive - and Gram negative-like 

cells. Numerous complete and partial genome sequences of prokaryotes have 
appeared in the public databases. 

In the present invention, new genes from the Gram positive bacteria, 
Streptococcus pyogenes (e.g., S. pyogenes) and Staphylococcus aureus (e.g., S. 

30 aureus) are identified. They are assigned names based on their nearest homology to 

subunits in the E. coli system. The genes encoding E. coli replication proteins are as 
follows: alpha {dnaE); epsilon (dnaQ); theta (holE); tau (full length dnaX); gamma 
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(frameshift product of dnaX); delta (hoi A); delta prime (holB); chi (holQ; psi (holD); 
beta (dnaN); DnaB helicase (dnaB); and pnmase (dnaG). 

Study of the organisms for which a complete genome sequence is 
available reveals that no organism has identifiable homologues to all the subunits of 
the E. coh three component polymerase, Pol HI holoenzyme (see Table 1 below). All 
other organisms lack the 6 subunit (holE), and all except one lack genes encoding 
the x and y subunits (holC and holD, respectively) as judged by sequence comparison 
searches. Further, the a and e subunits are fused into one large a subunit in some 
organisms (e.g., Gram positive cells) as detailed in (Sanjanwala et al., "DNA 
Polymerase ffl Gene of Bacillus subtilis" Proc. Natl. Acad. Sci., USA , 86:4421-4424 
(1989)). Although all organisms have homologues to t, (5, 5' and SSB, the 5 subunit has 
diverged significantly (either not recognized or nearly not recognized by gene 
searching programs), perhaps even to the point where it is no longer involved in DNA 
replication. The DnaX product also would appear to lack frameshift signals in most 
organisms. This predicts only one protein (tau) will be produced from this gene, 
instead of two as in E. coli. Indeed, this has been shown to be true for the 
Staphylococcus aureus DnaX (U.S. Patent Application Serial No. 09/235,245, which is 
hereby incorporated by reference). Finally, genetic study of Bacillus subtilis identified 
two genes that do not have counterparts in E. coli (dnaB, not the helicase, and dnaH) as 
well as one other gene, dnal, that is only very distantly related to E. coli dnaC 
(Karamata et al., "Isolation and Genetic Analysis of Temperature-Sensitive Mutants of 
B. subtilis Defense in DNA Synthesis," Molec. Gen. Genet. . 108:277-287 (1970); 
Braund et al., "Nucleotide Sequence of the Bacillus subtilis dnaD Gene," Microb ., 
141 :321-322 (1995); Hoshino et al., "Nucleotide Sequence of Bacillus subtilis dnaB: A 
Gene Essential for DNA Replication Initiation and Membrance Attachment," Proc. 
Natl. Acad. Sci. USA ." 84:653-657 (1987)). Keeping in mind the apparently random, 
or at least unpredictable process of evolution, it is possible that these apparently new 
genes perform novel functions that may result in a new type of polymerase for 
chromosomal replication. Thus, it seems possible that new proteins may have evolved 
to take the place of x, 9, the frameshift product of DnaX, and possibly 5 in other 
eubacteria. These considerations indicate that the three component polymerase of 
different eubacteria may have different structures. That this may be so would not be 
surprising as different bacteria are often less related evolutionarily than plants are to 
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humans. For example, the split between Gram positive and Gram negative bacteria 
occurred about 1 .2 billion years ago. This distant split makes Gram positive cells an 
attractive source to examine how different other eubacterial three component 
polymerases are from the E. coli Pol HI holoenzyme. 

Table 1 

Organism (Order) 

Escherichia coli 

Proteobacteria 
Haemophilus influenzae 

Proteobacteria 
Mycoplasma genitalium 

Firmicutes 
Synichisystis sp. 

Cyanobacteria 
Bacillus subtilis 

Firmicutes 
Borrelia burgdorferi 

Spirochaetales 
Aquifex aeolicus 

Aquificales 
Mycobacterium tuberculosis 

Firmicutes & Actinobactcria 
Treponema pallidum 

Spirochaetales 
Chlamydia trachomatis 

Chlamydiales 
Rickettsia prowazekii 

Proteobacteria 
Helicobacter pylori 

Proteobacteria 
Thermatoga maritima 

Thcrmotogales 

5 

The goal of this invention is to learn how to form a functional three 
component polymerase from an organism that is highly divergent from E. coli and 
whether it is as rapid and processive as the E. coli Pol m holoenzyme. Namely, from 
bacteria lacking x, v, or e, or having a widely divergent 6 subunit, or having only one 
10 DnaX product, or an a subunit that encompasses both a and e activities. All 

eubacteria for which the entire genome has been sequenced have at least one of these 
differences from E. coli. Many Gram negative bacteria have one or more of these 
differences (e.g., Haemophilus influenzae and Aquifex aeolicus ). Bacteria of the 
Gram positive class have all of these different features. Because of the distant 
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evolutionary split between Gram positive and Gram negative bacteria, their 
mechanisms of replication may have diverged significantly as well. Indeed, 
purification of the replication polymerase from B. subtilis, a Gram positive cell, gives 
only a single subunit polymerase (Barnes et al., "Purification of DNA Polymerase UJ 
5 of Gram-Positive Bacteria," Methods Enzy. 262:35-42 (1995); Barnes et al., 

"Antibody to B. subtilis DNA Polymerase HI: Use in Enzyme Purification and 
Examination of Homology Among Replication-specific DNA Polymerases," Nucl. 
Acids Res. , 6:1203-209 (1979); Barnes et al., "DNA Polymerase III of Mycoplasma 
pulmonis: Isolation and Characterization of the Enzyme and its Structural Gene, 
10 polC," Mol. Microb. , 13:843-854, (1994); Low et al., "Purification and 

Characterization of DNA Polymerase HI from Bacillus subtilis" J. Biol. Chem. , 
251:131 1-1325 (1976)) instead of a 10 subunit assembly containing the three 
components of a rapidly processive machine as discussed above for Pol EI 
holoenzyme from E. coli. This finding suggests a different structural organization of 
15 the replicase and possibly different functional characteristics as well. 

Although there are many studies of replication mechanisms in 
eukaryotes and, specifically, the Gram negative bacterium E. coli and its 
bacteriophages, there is very little information about how Gram positive organisms 
replicate. The Gram positive class of bacteria includes some of the worst human 
20 pathogens such as Staphylococcus aureus, Streptococcus pneumoniae, Streptococcus 

pyogenes, Enterococcus faecalis, and Mycobacterium tuberculosis (Youmans et al., 
The Biological and Clinical Basis of Infectious Disease (1985 )). Until this invention, 
the best characterized Gram positive organism for chromosomal DNA synthesis was 
Bacillus subtilis. Fractionation of B. subtitis has identified three DNA polymerases. 
25 (Gass et al, "Further Genetic and Enzymological Characterization of the Three 

Bacillus subtilis Deoxyribonucleic Acid Polymerases," J. Biol. Chem. , 248:7688-7700 
(1973); Ganesan et al., "DNA Replication in a Polymerase I Deficient Mutant and the 
Identification of DNA Polymerases H and HI in Bacillus subtilis," Biochem. Biophys. 
Res. Commun. , 50:155-163 (1973)). These polymerases are thought to be analogous 
30 to the three DNA polymerases of E. coli (DNA polymerases I, II, and HI). Studies in 

B. subtilis have identified a polymerase that appears to be involved in chromosome 
replication and is termed Pol m (Ott et al., "Cloning and Characterization of the polC 
Region of Bacillus subtilis," J. Bacteriol. . 165:951-957 (1986); Barnes et al., 



WO 01/09164 



-9- 



PCT/L'SOO/20666 



"Localization of the Exonuclease and Polymerase Domains of Bacillus subtilis DNA 
Polymerase in," Gene , 1 11:43-49 (1992); Barnes et al., 'The3'-5' Exonuclease Site 
of DNA Polymerase EH From Gram-positive Bacteria: Definition of a Novel Motif 
Structure," Gene " 165:45-50 (1995) or Barnes et al., "Purification of DNA 
Polymerase HI of Gram-positive Bacteria," Methods in Enzv. , 262:35-42 (1995)). The 
B. subtilis Pol HJ (encoded by polQ is larger (about 1 65 kDa) than the E. coli alpha 
subunit (about 129 kDa) and exhibits 3 '-5' exonuclease activity. The polC gene 
encoding this Pol EH shows weak homology to the genes encoding E. coli alpha and 
the E. coli epsilon subunit. Hence, this long form of the B. subtilis Pol HI (herein 
referred to as a -large or Pol EI-L) essentially comprises both the alpha and epsilon 
subunits of the E. coli core polymerase. The S aureus a -large has also been 
sequenced, expressed in E. coli, and purified; it contains DNA polymerase and 3 '-5' 
exonuclease activity (Pacini et al., "Characterization and Overexpression of the Gene 
Encoding Staphylococcus aureus DNA Polymerase EH," Gene , 165:51-56 (1995)). 
Although a -large is essential to cell growth (Clements et al., "Inhibition of Bacillus 
subtilis Deoxyribonucleic Acid Polymerase EH by Phenylhydrazinopyrimidines : 
Demonstration of a Drug-induced Deoxyribonucleic Acid-Enzyme Complex," J. Biol. 
Chem. . 250:522-526 (1975); Cozzarelli et al., "Mutational Alteraction of Bacillus 
subtilis DNA Polymerase EH to Hydroxyphenylazopyrimidine Resistance: Polymerase 
EH is Necessary for DNA Replication," Biochem. And Biophv. Res. Commun. . 
51:151-157 (1973); Low et al., "Mechanism of Inhibition of Bacillus subtilis DNA 
Polymerase EH by the Arylhydrazinopyrimidine Antimicrobial Agents," Proc. Natl. 
Acad. Sci. USA . 71 :2973-2977 (1974)), there could still be another DNA 
polymcrase(s) that is essential to the cell, such as occurs in yeast (Morrison et al., "A 
Third Essential DNA Polymerase in S. cerevisiae" Cell, 62:1143-1151 (1990)). 

Purification of a -large from B. subtilis results in only this single 
protein without associated proteins (Barnes et al., "Localization of the Exonuclease 
and Polymerase Domains of Bacillus subtilis DNA Polymerase III," Gene, 1 1 1 :43-49 
(1992); Barnes et al., "The 3 '-5' Exonuclease Site of DNA Polymerase m From 
Gram-positive Bacteria: Definition of a Novel Motif Structure," Gene " 165:45-50 
(1995) or Barnes et al., "Purification of DNA Polymerase HI of Gram-positive 
Bacteria," Methods in Enzvmol.. 262:35-42 (1995)). Hence, it is possible that a -large 
is a member of the Type I replicase (like T5) in which it is processive on its own 
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without accessory proteins. B. subtilis and S. aureus also have a gene encoding a 
protein that has approximately 30% homology to the beta subunit of E. coli; however, 
the protein product has not been purified or characterized (Alonso et al., "Nucleotide 
Sequence of the recF Gene Cluster From Staphylococcus aureus and 
5 Complementation Analysis in Bacillus subtilis recF Mutants," Mol. Gen. Genet. . 

246:680-686 (1995); Alonso et al., "Nucleotide Sequence of the recF Gene Cluster 
From Staphylococcus aureus and Complementation Analysis in Bacillus subtilis recF 
Mutants," Mol. Gen. Genet. . 248:635-636 (1995)). Whether this beta subunit has a 
function in replication, a ring shape, or functions as a sliding clamp was not known 
10 until recently. It was also not known whether it is functional with a -large. Recently, 

it was shown that S, aureus (i is functional as a ring, and that it also functions with a - 
large (U.S. Patent Application Serial No. 09/235,245, which is hereby incorporated by 
reference). Further, a fourth DNA polymerase was identified with greater homology 
to E. coli a than a -large. This polymerase, called herein a -small, is shorter than a - 
15 large and lacks the domain homologous to epsilon. This polymerase also functions 

with the p ring, indicating that it may participate in chromosome replication. Indeed, a 
recent report indicates that a -small is essential for replication in Streptomyces 
coelicolor A3 (2) (Flett et al., "A Gram-negative type' DNA Polymerase m is Essential 
for Replication of the Linear Chromosome of Streptomyces Coelicolor A3(2)," Mol. 
20 Micro. . 31:949-958. (1999)). 

As described earlier, purification of the replicase from the Gram 
positive B. subtilis gives only a single subunit Pol m, instead of a multicomponent 
complex. Also, S. aureus dnaXhas been shown to encode only one subunit (U.S. 
Patent Application Serial No. 09/235,245, which is hereby incorporated by reference). 
25 Moreover, S. aureus and B. subtilis lack homologues to x, V, 6, and the 5 subunit is 

only weakly homologous to 5 of E. coli (only 28%). Further, they lack a homologue to 
dnaQ encoding e. Instead, they contain this activity (3 '-5' exonuclease) in the polC 
gene product which provides the a -large form of a. The e subunit is needed for high 
speed and processivity of the E. coli Pol EI holoenzyme; the a subunit alone is much 
30 less rapid and processive with the p ring compared to the presence of both 

a and e (Studwell et al., "Processive Replication is Contingent on the Exonuclease 
Subunit ofDNA Polymerase IE Holoenzyme," J. Biol Chem . 265: 1171-1178(1990)). 
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Studies using the E. coli P ring (and y complex) show they confer onto 
S. aureus a quite efficient synthesis (U.S. Patent Application Serial No. 09/235,245, 
which is hereby incorporated by reference), but the efficiency is not equal to that of E. 
coli at with p (and y complex). This may be due to use of the heterologous 
combination of an a subunit from one organism (5". aureus) with the p clamp from 
another (£. coli.). However, it is also possible that S. aureus a simply does not 
function with a p clamp to produce speed and processivity comparable to the E. coli 
polymerase. Also, as described earlier, the a -large subunit of B. subtilis purifies as a 
single subunit, rather than associated with accessory subunits assembled into the three 
components of a rapid, processive machine (i.e., like E. coli Pol HI holoenzyme). The 
lack of two DnaX products, lack of a multicomponent structure, and lack of gene 
homologues encoding several subunits of the three component, Pol m, of E. coli brings 
into question whether other types of bacteria, such as Gram positive cells, even have an 
enzyme with similar structure or comparable speed and processivity to that found in 
the Gram negative E. coli. 

The lack of gene homologues encoding several subunits of the E. coli 
three component polymerase creates uncertainties with respect to reconstructing a 
rapid and processive polymerase from a Gram positive cell that has characteristics like 
the Pol m system of E. coli. 

The y and 5' proteins are homologous to one another, encoding C-shape 
proteins (Dong et al., "DNA Polymerase III Accessory Proteins," J. Biol. Chem , 
268:1 1758-1 1765, (1993); Guenther et al., "Crystal Structure of the 8' Subunit of the 
Clamp-loader Complex of E. coli DNA Polymerase ID," Cell, 91:335-345 (1997)). 
The clamp loaders of yeast and humans are composed of five proteins, all of which are 
homologous to one another and to y and 5' (Cullman et al., "Characterization of the 
Five Replication Factor C Genes of Saccharomyces Cerevisiae," Mol. Cell. Biol. , 
15:4661-4671 (1995)). This provides evidence that a clamp loader can be composed 
entirely of C-shape proteins. Perhaps the Gram positive DnaX-protein (hereafter 
referred to as t) and 6' are sufficient to provide function as a clamp loader. Indeed, the 
clamp loader of T4 phage is composed of only two different proteins, gp44/62 
complex (Young et al., "Structure and Function of the Bacteriophage T4 DNA 
Polymerase Holoenzyme," Biochem. , 31:8675-8690 (1992)). This idea is also 
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supported by the presence of only two RFC genes in archaebacteria, suggesting that 
they may utilize two C-shaped proteins for clamp loading, in contrast to yeast and 
humans that use five. With this consideration in mind, genes were identified and 
isolated and the x protein (encoded by dnaX) and 5' (encoded by holB) of another 
5 Gram positive organism, Streptococcus pyogenes, were expressed and purified. As 

was observed in S. aureus, S. pyogenes dnaX produces only a single polypeptide. The 
p, encoded by dnaN of S. pyogenes, was also identified, expressed, and purified, as 
were the a -large subunit encoded by polC and the SSB encoded by the ssb gene. 
These proteins were studied for interactions and characterized for their effect on cc- 

10 large. However, the hypothesis was incorrect as t and 5' did not form a t5' complex, 

nor did they assemble p onto DNA or provide stimulation of a when using p on 
primed and SSB coated M13mpl8 ssDNA. 

In light of the inability of S. pyogenes t protein and 5' to function as a 
clamp loader, it seemed reasonable to expect that one or more other proteins are 

15 needed. The fact that E. coli has some replicase subunits that other bacteria do not, 

suggests that other bacteria may have some replicase subunits that E. coli does not. 
Indeed, genetic studies of Bacillus subtilis demonstrates that it has three genes needed 
for replication that E. coli does not have. Two of these novel genes, called dnaB (not 
the same as E. coli dnaB encoding the helicase) and dnaH, have no significant 

20 homology to genes in the E. coli genome database (Bruand et al., "Nucleotide 

Sequence of the Bacillus subtilis dnaD gene," Microbiol ., 1 41 :32 1 -322 (1 995); 
Hoshino et al., "Nucleotide Sequence of Bacillus subtilis dnaB: A gene Essential for 
DNA replication Initiation and Membrane Attachment," Proc. Natl. Acad. Sci. USA. 
84:653-657 (3987)). Further, dnal of B. subtilis is important for replication and has, 

25 at best, a very limited homology to E. coli dnaC (Karamata et al., "Isolation and 

Genetic Analysis of Temperature-Sensitive Mutants of B. subtilis Defective in DNA 
synthesis," Molec. Gen. Genetics . 108:277-287 (1970)). Perhaps one or more of these 
genes encode the proteins(s) necessary to provide clamp loading activity when 
combined with t and 6', or to couple with a to provide it with speed and/or 

30 processivity as the E. coli epsilon does. The S. pyogenes homologues of B. subtilis 

dnal, dnaH, and dnaB were identified, cloned, and the encoded proteins were 
expressed and purified. However, these proteins failed to provide activity alone or in 
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combinations with S. pyogenes t and 6' in loading 5 1 . pyogenes p onto DNA, or in 
stimulating £. pyogenes a -large in combination with p, t, and 5' on SSB coated 
primed M 1 3mp 1 8 ssDNA. 

Weak homology exists for the holA gene among prokaryotes. This 
5 weak homologue of holA was identified in S. pyogenes and, then, it was cloned, 

expressed, and the putative 6 was purified. The putative 5 formed an isolatable 
complex with t and 5'. In fact, the t65' complex loaded S. pyogenes p onto DNA, and 
it stimulated S. pyogenes a -large in a p dependent reaction on primed SSB coated 
M13mpl 8 ssDNA. Hence, this protein was the only missing component necessary to 

10 provide clamp loading activity. Further, a mixture of a with t65', followed by ion 

exchange chromatography on MonoQ, indicated formation of an ca5S' complex. 
Consistent with this, t appeared to bind a in gel filtration analysis. 

Whether the S. pyogenes three component polymerase can synthesize 
DNA in as rapid and processive of a fashion as the E. coli Pol m holoenzyme three 

15 component polymerase is very difficult to predict, because no other DNA polymerase 

known to date catalyzes synthesis at the rate or processivity of the E. coli three 
component polymerase. For example, the three component T4 phage polymerase 
travels about 400 nucleotides/s, the yeast DNA polymerase delta three component 
polymerase travels about 120 nucleotides/s, and the human DNA polymerase delta 

20 three component enzyme appears slower and less processive than the yeast enzyme. 

The standard test for these speed and processivity characteristics is 
examination of a time course in extension of a primer on a very long template, such as 
around the 7.2 kb Ml 3mpl 8 ssDNA genome coated with SSB and primed with a 
synthetic DNA oligonucleotide. The results of experiments of this type demonstrate 

25 that the three component S. pyogenes polymerase is indeed extremely rapid in 

synthesis. Surprisingly, it is just as fast as the E. coli enzyme. Extension proceeds at 
about 700-800 nucleotides per second, completing the entire template in about 9 
seconds. The enzyme was fully processive throughout replication of the Ml3mpl 8 
genome, as could be determined from the fact that some templates were not extended 

30 at all, while others were extended to completion. If the enzyme had not been 

processive during the entire replication reaction, then when it comes off one partially 
extended DNA genome it would have reassociated with the unextended DNA that 
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remained and partially replicated it as well (and so on until the entire population of 
DNA became fully replicated). This did not happen. Instead, the reaction showed a 
mixture of completely replicated templates and templates that were still untouched 
starting material. This indicates that the enzyme stays with a template until it 
5 completes it before it cycles over to replicate another one (i.e., it is highly 

processive). Each of the five proteins, a, t, 6, 5' and p, are needed to obtain this rapid 
and processive DNA synthesis. 

This invention has provided an intellectual template by which the 
clamp loader component of these three component polymerases can be obtained from 

10 any eubacterial prokaryotic cell and how to use it with the other components to 

produce a rapid and processive polymerase. All prokaryotes in the eubacterial 
kingdom that have been sequenced to date contain homologues of these genes. As the 
process of lateral gene transfer appears to be a major force in evolution, it would 
appear that relatedness of enzymes and enzyme machines is best judged by 

15 comparisons of their genes and proteins rather than by phylogeny of which bacteria 

they are in (Doolittle et al., "Archaeal Genomics: Do Archaea have a Mixed 
Heritage?," Curr. Biol. , 8:R209-R21 1 (1998)). As pointed out earlier in this 
application, most bacteria have genetic characteristics of replication genes/proteins of 
S. pyogenes rather than that of E. coli (i.e., no genes encoding x,v> or 9, only a weak 

20 homolog to 6, or a dnaX gene encoding only a single protein). 

The dnaX gene encoding t and y in E. coli encodes only one protein in 
some organisms, but, as this application shows, it is still functional in forming a 
protein complex capable of rapid and processive DNA synthesis. In addition, this 
application shows that the delta subunit, which is only weakly homologous among 

25 different prokaryotic organisms, is an essential functional subunit of the three 

component polymerase (instead of having diverged so as to fulfill an entirely different 
function in some other intracellular process). As mentioned earlier, several genes 
encoding subunits of the E. coli clamp loader (y complex; y.S.S'.x.v) are not obviously 
present in other prokaryotes (holC and holD encoding x and v ). Hence, one may 

30 anticipate that other genes may have evolved to encode new subunits that replace 

these, and that these new subunits may have been essential to the activity of the clamp 
loader. For example, they may have either taken over some of the functionality of 
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another subunit, or structurally (e.g., the physical presence of a subunit could be 
needed for one subunit to assume its proper and active conformation, or for one or 
more of the subunits to form a complex together to yield the multisubunit clamp 
loader assembly). In addition, this application shows that the a subunit (polC gene 
5 product) is sufficient for rapid and processive synthesis with the other two 

components (i.e., E. coli requires z submit to bind to a for rapid and processive 
synthesis of a with the p clamp). Finally, this application shows that the 5. pyogenes 
three component polymerase synthesizes DNA as fast as the E. coli Pol HI three 
component polymerase. Up to this point, the E. coli Pol HI three component 
10 polymerase was over twice the speed of the T4 enzyme and over 5 times the speed of 

others. Hence, it was possible that E. coli may have been unique among prokaryotes 
in having a polymerase that achieves such speed. This invention shows that this is not 
the case. Instead, this speed in polymerization generalizes to the Gram positive 
prokaryotic three component DNA polymerases. It may be presumed, now that two 
15 examples of three component polymerases in widely divergent bacteria share the 

charactistics of rapid, processive synthesis, that the three component polymerase of 
other eubacteria will also be rapid and processive. 

These rapid and processive three component DNA polymerases can be 
applied to several important uses. DNA polymerases currently in use for DNA 
20 sequencing and DNA amplification use enzymes that are much slower and thus could 

be improved upon. This is especially true of amplification as the three component 
polymerase is capable of speed and high processivity making possible amplification of 
very long (tens of Kb to Mb) lengths of DNA in a time efficient manner. These three 
component polymerases also function in conjunction with a replicative helicase 
25 (DnaB) and, thus, are capable of amplification at ambient temperature using the 

helicase to melt the DNA duplex. This property could be useful in amplification 
reaction procedures such as in polymerase chain reaction (PCR) methodology. 
Finally, these three component polymerases and their associated helicase (DnaB) and 
primase (DnaG) are attractive targets for antibiotics due to their essential and central 
30 role in cell viability. 

This application provides a three component polymerase from two 
human pathogens in the Gram positive class. It makes possible the production of this 
three component polymerase from other bacteria of the Gram positive type (e.g., 
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Streptococci, Staphylococci, Mycoplasma) and other types of bacteria 
lacking x,W, or 6, those having only one protein produced by their dnaX gene such as 
obligate intracellular parasites, Mycoplasmas (possibly evolved from Gram positives), 
Cyanobacteria (Synechocystis), Spirochaetes such as Borrelia and Treponemia and 
Chlamydia, and distant relatives of E. coli in the Gram negative class (e.g., Rickettsia 
and Helicobacter). These three component polymerases are useful in manipulation of 
nucleic acids for research and diagnostic purposes (e.g., sequencing and amplification 
methods) and for screening chemicals for antibiotic activity (useful in human or 
animal therapy and agriculture such as animal feed supplements). There are several 
assays described previously in U.S. Patent Application Serial No. 09/235,245 to 
O'Donnell et al., which is hereby incorporated by reference, that use these three 
component polymerases (or subassemblies), as well as the DnaB and DnaG 
homologues, either alone or in various combinations, for the purpose of screening 
chemicals, such as chemical libraries, for inhibitor activity. Such inhibitors can be 
developed further (usually by chemical manipulation and alteration) into lead 
compounds and then into full fledged pharmaceuticals. 

There remains a need to understand the molecular details of the process 
of DNA replication in other cells that are quite different from E. coli, such as in Gram 
positive cells. It is possible that a more detailed understanding of replication proteins 
will lead to discovery of new antibiotics. Therefore, a deeper understanding of 
replication proteins of Gram positive bacteria is especially important given the 
emergence of drug resistant strains of these organisms. For example, Staphylococcus 
aureus has successfully mutated to become resistant to all common antibiotics. 

The "target" protein(s) of an antibiotic drug is generally involved in a 
critical cell function, such that blocking its action with a drug causes the pathogenic 
cell to die or no longer proliferate. Current antibiotics are directed to very few targets. 
These include membrane synthesis proteins (e.g., vancomycin, penicillin, and its 
derivatives such as ampicillin, amoxicillin, and cephalosporin), the ribosome 
machinery (e.g., tetracycline, chloramphenicol, azithromycin, and the aminoglycosides 
such as kanamycin, neomycin, gentamicin, streptomycin), RNA polymerase (e.g., 
rifampimycin), and DNA topoisomerases (e.g., novobiocin, quinolones, and 
fluoroquinolones). The DNA replication apparatus is a crucial life process and, thus, 
the proteins involved in this process are good targets for antibiotics. 
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A powerful approach to discovery of a new drug is to obtain a target 
protein, characterize it, and develop in vitro assays of its cellular function. Large 
chemical libraries can then be screened in the functional assays to identify compounds 
that inhibit the target protein. These candidate pharmaceuticals can then be 
5 chemically modified to optimize their potency, breadth of antibiotic spectrum, non- 

toxicity, performance in animal models and, finally, clinical trials. The screening of 
large chemical libraries requires a plentiful source of the target protein. An abundant 
supply of protein generally requires overproduction techniques using the gene 
encoding the protein. This is especially true for replication proteins as they are 
10 present in low abundance in the cell. 

Selective and robust assays are needed to screen reliably a large 
chemical library. The assay should be insensitive to most chemicals in the 
concentration range normally used in the drug discovery process. These assays should 
also be selective and not show inhibition by antibiotics known to target proteins in 
15 processes outside of replication. 

The present invention is directed to overcoming these deficiencies in 

the art. 

SUMMARY OF THE INVENTION 

20 

The present invention relates to various isolated DNA molecules from 
Staphylococcus aureus and Streptococcus pyogenes, both of which are Gram positive 
bacteria. These include DNA molecules which include a coding region from the dnaE 
gene (encoding a- small), dnaX gene (encoding tau), polC gene (encoding Pol m -L 

25 or a- large), dnaN gene (encoding beta), holA gene (encoding delta), holB gene 

(encoding delta prime), ssb gene (encoding SSB), dnaB gene (encoding DnaB), and 
dnaG gene (encoding DnaG) of S. aureus and/or S. pyogenes. These DNA molecules 
can be inserted into an expression system and used to transform host cells. The 
isolated proteins or polypeptides encoded by these DNA molecules, and their ability to 

30 function when used in combination is also disclosed. The resulting actions provide 

assembling a ring onto DNA via a clamp loader, and polymerase activity dependent 
on this ring that is rapid and processive. 



WO (11/09164 



-18- 



PCT/l' SOU/20666 



A further aspect of the present invention relates to a method of 
identifying compounds which inhibit activity of a polymerase product of polC or 
dnaE. This method is carried out by forming a reaction mixture comprising a primed 
DNA molecule, a polymerase product of polC or dnaE, a candidate compound, a 
5 dNTP, and optionally either a beta subunit, a tau complex, or both the beta subunit 

and the tau complex, wherein at least one of the polymerase product of polC or dnaE, 
the beta subunit, the tau complex, or a subunit or combination of subunits thereof is 
derived from a Eubacteria other than Escherichia coli; subjecting the reaction mixture 
to conditions effective to achieve nucleic acid polymerization in the absence of the 

10 candidate compound; analyzing the reaction mixture for the presence or absence of 

nucleic acid polymerization extension products; and identifying the candidate 
compound in the reaction mixture where there is an absence of nucleic acid 
polymerization extension products. 

The present invention deciphers the structure and mechanism of the 

15 chromosomal replicase of Gram positive bacteria and other bacteria lacking holC, 

holD, holE or dnaQ genes, or having a dnaXgene that encodes only one protein. 
Rather than use a DNA polymerase that attains high efficiency on its own, or with one 
other subunit, the Gram positive bacteria replicase is a three component type of 
replicase (class IH) that uses a sliding clamp protein. The Gram positive bacteria 

20 replicase also uses a clamp loader component that assembles the sliding clamp onto 

DNA. This knowledge, and the enzymes involved in the replication process, can be 
used for the purpose of screening for potential antibiotic drugs. Further, information 
about chromosomal replicases maybe useful in DNA sequencing, DNA amplification, 
polymerase chain reaction, and other DNA polymerase related techniques. 

25 The present invention identifies two DNA polymerases (both of Pol III 

type) in Gram positive bacteria that utilize the sliding clamp and clamp loader. The 
present invention also identifies a gene with homology to the alpha subunit of E. coli 
DNA polymerase HI holoenzyme, the chromosomal replicase of E. coli. These DNA 
polymerases can extend a primer around a large circular natural template when the 

30 beta clamp has been assembled onto the primed ssDNA by the clamp loader or a 

primer on a linear DNA where the beta clamp may assemble by itself by sliding over 
an end. 
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The present invention shows that the clamp and clamp loader 
components of Gram negative cells can be exchanged for those of Gram positive cells 
in that the clamp, once assembled onto DNA, will function with Pol ITT obtained from 
either Gram positive and Gram negative sources. This result implies that important 
contacts between the polymerase and clamp have been conserved during evolution. 
Therefore, these "mixed systems" may provide assays for an inhibitor of this 
conserved interaction. Such an inhibitor may be expected to shut down replication, 
and since the interaction is apparently conserved across the evolutionary spectrum 
from Gram positive and Gram negative cells, the inhibitor may exhibit a broad 
spectrum of antibiotic activity. 

The present invention demonstrates that Gram positive bacteria contain 
a beta subunit that behaves as a sliding clamp that encircles DNA. A dnaX gene 
sequence encoding a protein homolog of the gamma/tau subunit of the clamp loader 
(gamma/tau complex) E. coli DNA polymerase ITI holoenzyme is also identified. The 
presence of this gene confirms the presence of a clamp loading apparatus in Gram 
positive bacteria that will assemble beta clamps onto DNA for the DNA polymerases. 

This application also outlines methods and assays for use of these 
replication proteins in drug screening processes. 

BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1 shows the construction of the S. aureus Pol HT-L expression 
vector. The gene encoding Pol ITJ-L was cloned into a pETl 1 expression vector in a 
three step cloning scheme as illustrated. 

Figures 2A-C describe the expression and purification of S. aureus Pol 
IH-L (alpha- large). Figure 2A compares E. coli cells that contain the pETl lPolC 
expression vector that are either induced or uninduced for protein expression. The gel 
is stained with Coomassie Blue. The induced band corresponds to the expected 
molecular weight of the S. aureus Pol IU-L, and is indicated to the right of the gel. 
Figure 2B shows the results of the MonoQ chromatography of a lysate of E. coli 
(pETl lPolC-L) induced for Pol ITT-L. The fractions were analyzed in a Coomassie 
Blue stained gel (top) and for DNA synthesis (bottom). Fractions containing Pol IU-L 
are indicated. In Figure 2C, fractions containing Pol III-L from the MonoQ column 
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were pooled and chromatographed on a phosphocellulose column. This shows an 
analysis of the column fractions from the phosphocellulose column in a Coomassie 
Blue stained polyacrylamide gel. The position of Pol HI-L is indicated to the right. 

Figure 3 shows the S. aureus heta expression vector. The dnaN gene 
5 was amplified from S. aureus genomic DNA and cloned into the pET16 expression 

vector. 

Figures 4A-C illustrate the expression and purification of S. aureus 
beta. Figure 4A compares E. coli cells that contain the pET16beta expression vector 
that are either induced or uninduced for protein expression. The gel is stained with 

10 Coomassie Blue. The induced band corresponds to the expected molecular weight of 

the S. aureus beta, and is indicated to the right of the gel. The migration position of 
size standards are indicated to the left. Figure 4B shows the results of MonoQ 
chromatography of an E. coli (pET16beta) lysate induced for beta. The fractions were 
analyzed in a Coomassie Blue stained gel, and fractions containing beta are indicated. 

15 In Figure 4C, fractions containing beta from the MonoQ column were pooled and 

chromatographed on a phosphocellulose column. This shows an analysis of the 
column fractions from the phosphocellulose column in a Coomassie Blue stained 
polyacrylamide gel. The position of beta is indicated to the right. 

Figures 5A-B demonstrate that the S. aureus beta stimulates 5". aureus 

20 Pol ni-L and E. coli Pol III core on linear DNA, but not circular DNA. In Figure 5A, 

the indicated proteins were added to replication reactions containing polydA-oligodT 
as described in the Examples infra. Amounts of proteins added, when present, were: 
lanes 1 ,2: S. aureus Pol ITI-L, 7.5 ng; S. aureus beta, 6.2 pig; Lanes 3,4: E. coli Pol ITJ 
core, 45 ng; S. aureus beta, 9.3 pg; Lanes 5,6: E. coli Pol HI core, 45 ng; E. coli beta, 

25 5pg. Total DNA synthesis was: Lanes 1-6: 4.4, 30.3, 5.1, 35.5, 0.97, 28.1 pmol, 

respectively. In Figure 5B, Lanes 1-3, the indicated proteins were added to replication 
reactions containing circular singly primed M13mpl8 ssDNA as described in the 
Examples infra. S. aureus beta, 0.8 pg; & aureus Pol ITI-L, 300 ng (purified through 
MonoQ); E. coli clamp loader complex, 1.7 pg. Results in the E. coli system are 

30 shown in Lanes 4-6. Total DNA synthesis was: Lanes 1-6: 0.6, 0.36, 0.99, 2.7, 3.5, 

280 pmol, respectively. 

Figure 6 shows that S. aureus Pol EH-L functions with E. coli beta and 
clamp loader complex on circular primed DNA. It also shows that S. aureus beta does 
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not convert Pol Ett-L with sufficient processivity to extend the primer all the way 
around a circular DNA. Replication reactions were performed on the circular singly 
primed M13mpl8 ssDNA. Proteins added to the assay are as indicated in this figure. 
The amount of each protein, when present, is: S. aureus beta, 800 ng; S. aureus Pol 
m-L, 1500 ng (MonoQ fraction 64); E. coli Pol IE core, 450 ng; E. coli beta, 100 ng; 
E. coli gamma complex, 1 720 ng. Total DNA synthesis in each assay is indicated at 
the bottom of the figure. 

Figures 7A-B show that S. aureus contains four distinct DNA 
polymerases. Four different DNA polymerases were partially purified from S. aureus 
cells. S. aureus cell lysate was separated from DNA and, then, chromatographed on a 
MonoQ column. Fractions were analyzed for DNA polymerase activity. Three peaks 
of activity were observed. The second peak was the largest and was expected to be a 
mixture of two DNA polymerases based on early studies in B. subtilis. 
Chromatography of the second peak on phosphocellulose (Figure 7B) resolved two 
DNA polymerases from one another. 

Figures 8A-B show that S. aureus has two DNA Pol Ill's. The four 
DNA polymerases partially purified from S. aureus extract, designated peaks I-TV in 
Figure 7, were assayed on circular singly primed M13mpl8 ssDNA coated with E. 
coli SSB either in the presence or absence of E. coli beta (50ng) and clamp loader 
complex (50 ng). Each reaction contained 2 p.1 of the partially pure polymerase (Peak 
1 was Mono Q fraction 24 (1 .4 ug), Peak 2 was phosphocellulose fraction 26 (0.01 6 
mg/ml), Peak 3 was phosphocellulose fraction 46 (0.18 mg/ml), and Peak 4 was 
MonoQ fraction 50 (1 ug). Figure 8A shows the product analysis in an agarose gel. 
Figure 8B shows the extent of DNA synthesis in each assay. 

Figure 9 compares the homology between the polypeptide encoded by 
dnaE of S. aureus and other organisms. An alignment is shown for the amino acid 
sequence of the S. aureus dnaE product with the dnaE products (alpha subunits) of E. 
coli and Salmonella typhimurium. 

Figure 1 0 compares the homology between the N-terminal regions of 
the gamma/tau polypeptides of S. aureus, B. subtilis, and E. coli. The conserved ATP 
site and the cystines forming the zinc finger are indicated above the sequence. The 
organisms used in the alignment were: E. coli (GenBank); and B. subtilis. 
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Figure 1 1 compares the homology between the DnaB polypeptide of 
S. aureus and other organisms. The organisms used in the alignment were: E. coli 
(GenBank); B. subtilis; Sal.Typ., {Salmonella typhimurium). 

Figures 12A-B show the alignment of the delta subunit encoded by 
holA for E. coli and B. subtilis (Figure 12A) and for the delta subunit of B. subtilis and 
S. pyogenes (Figure 12B). Figure 12A shows ClustalW generated alignment of S. 
pyogenes (Gram positive) delta to E.coli (Gram negative) delta. Figure 12B shows 
ClustalW generated alignment of B. subtilis (Gram positive) delta to S. pyogenes 
(Gram positive) delta. 

Figure 1 3 is an image of an autoradiograph of an agarose gel analysis 
of replication products from singly primed, SSB coated M 1 3mp 1 8 ssDNA using the 
reconstituted S. aureus Pol HI holozyme. Only in the presence of the t85' complex 
does a-large (PolC) function with P to replicate a full circular duplex DNA (RFIT). 

Figure 14 shows a Comassie Blue stained SDS polyacrylamide gel of 
the pure S. pyogenes subunits corresponding to alpha-large, alpha-small, dnaX gene 
product (called tau), beta, delta, delta prime, and SSB. The first lane shows the 
position of molecular weight markers. Purified proteins were separated on a 15% 
SDS-PAGE and stained with Coommassie Brilliant Blue R-250. Each lane contains 5 
microgram of each protein. Lane 1, markers; lane 2, alpha-large; lane 3, alpha-small, 
lane 4, tau subunit; lane 5, beta subunit; lane 6, delta subunit; lane 7, delta prime 
subunit; lane 8, single strand DNA binding protein. 

Figures 1 5A-C document the ability to reconstitute the t55' complex of 
S- pyogenes. Proteins were mixed and gel filtered on Superose 6, followed by analysis 
of the column fractions in a SDS polyacrylamide gel. Figure 15A shows a mixture of 
t85'. Figure 15B shows a mixture of t5. Figure 1 5C shows a mixture of t5'. 

Figures 16A-E show that the S. pyogenes x85' complex can load the S. 
pyogenes beta clamp onto (circular) DNA. Loading reactions contained 500 fin 
nicked pBSK plasmid, 500 fin either t55' complex, tau, delta, or delta prime, 1pm 32 P- 
labelled beta dimer, 8 mM MgCl2, 1 mM ATP. Reaction components were 
preincubated for 10 min at 37°C prior to loading onto 5 ml Biogel Al 5M column 
equilibrated with buffer A containing 100 mM NaCl. Figure 16A demonstrates the 
ability of x55' complex to load the beta dimer onto a nicked pBSK circular plasmid. 
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Figures 1 6B-E show the results of using either: beta alone (Figure 1 6B); 65' plus (3 
(Figure 16C); x, 6 and p (Figure 16D); x, 6' and p (Figure 16E). 

Figures 1 7A-C show that x and alpha interact. Figure 1 7 A shows the 
result of gel filtration analysis of a mixture of x with alpha-large. Gel filtration 
fractions are analyzed in a SDS polyacryl amide gel. Figures 17B and 17C show the 
results using only x or only alpha-large, respectively. Comparison of the elution 
positions of proteins shows that the positions of alpha and tau are shifted toward a 
higher molecular weight complex when they are present together. The fact they do 
not exactly comigrate may indicate that they initially are together in a complex, but 
that the complex dissociates during the time of the gel filtration experiment (over one 
half hour). 

Figures 1 8A-B document the ability to reconstitute aix5S' (pol HI*) 
complex of S. pyogenes. Proteins were mixed, preincubated for 20 min at 15°C, gel 
filtered on Superose 6, followed by analysis of the column fractions in a SDS 
polyacrylamide gel (Figure 1 8A). Proteins were loaded on a MonoQ column, then 
eluted with a linear gradient of 50-500 mM NaCl, followed by analysis of the column 
fractions in a SDS polyacrylamide gel (Figure 18B). The a L x55' complex migrates 
early. 

Figure 1 9 illustrates the speed and processivity of the S. pyogenes 
a L x55' (pol m*) complex. The a L x66' (pol HI*) complex was incubated with primed 
Ml 3pm 18 ssDNA (coated with S. pyogenes SSB) and only two dNTPs, then 
replication was initiated upon adding the remaining two dNTPs. Reactions contained 
25 fmol singly primed M13mpl8 ssDNA template, 300 fmol p 2 , and either 75 fmol or 
250 fmol a L x55'. Time points were quenched with SDS/EDTA then analyzed in a 
neutral agarose gel followed by autoradiography. Each time point is a separate 
reaction. The time course of polymerization was performed at two different ratios of 
polymerase/primed template to assess speed and processivity of nucleotide 
incorporation. 

Figures 20A-I show the extent of homology between S. pyogenes 
replication genes and other organisms. Due to the low homology of delta 
(Figure 20D), one must "walk" from one organism to the next in order to recognize 
the homologue with high probability. Percent identity over regions of the indicated 
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number of amino acid residues is shown for each match (i.e., the two organisms at the 
opposite ends of each line). Amino acid sequences were retrieved from either 
GenBank or individual unfinished genome databases. 

Figure 21 A-F are images illustrating that the S. pyogenes DnaE (alpha- 
small) polymerase functions with p\ Figures 21 A-B illustrate the relationship between 
DnaE and P for association with ssDNA. Different amounts of DnaE polymerase 
were added to a SSB coated Ml 3mpl 8 ssDNA circle primed with a single DNA 
oligonucleotide, and products were analyzed in a native agarose gel. Reactions were 
performed in the presence of t85' and either the absence (Figure 21 C, panels 1-4) or 
presence (Figure 2 ID, panels 1-4) of p\ Positions of completed duplex (RFU) and 
initial primed template (ssDNA) are indicated. Figure 21 E shows an analysis of 
exonuclease activity by PolC and DnaE on a 5'-32P-DNA 30-mer. Aliquots were 
removed at the indicated times and analyzed in a sequencing gel. Figure 21 F shows 
the effect of TMAU on PolC and DnaE in the presence of x55' and p. DNA products 
were analyzed in a native agarose gel. Positions of initial primed M 1 3mp 1 8 (ssDNA) 
and completed circular duplex (REIT) are indicated. 

DETAILED DESCRIPTION OF THE INVENTION 

The present invention relates to various isolated nucleic acid molecules 
from Gram positive bacteria and other bacteria lacking holC, holD, or holE genes or 
having a dnaX gene encoding only one subunit. These include DNA molecules which 
correspond to the coding regions of the dnaE, dnaX, holA, holB, polC, dnaN, SSB, 
dnaB, and dnaG genes. These DNA molecules can be inserted into an expression 
system or used to transform host cells. The isolated proteins or polypeptides encoded 
by these DNA molecules and their use to form a three component polymerase are also 
disclosed. Also encompassed by the present invention are corresponding RNA 
molecules transcribed from the DNA molecules. 

These DNA molecules and proteins can be derived from numerous 
bacteria, including Staphylococcus, Streptococcus, Enterococcus, Mycoplasma, 
Mycobacterium, Borrelia, Treponema, Rickettsia, Chlamydia, Helicobacter, and 
Thermatoga. It is particularly directed to such DNA molecules and proteins derived 
from Streptococcus and Staphylococcus bacteria, particularly Streptococcus pyogenes 
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and Staphylococcus aureus (see U.S. Patent Application Serial No. 09/235,245, which 
is hereby incorporated by reference). 

The gene sequences used to obtain DNA molecules of the present 
invention were obtained by sequence comparisons with the E. coli counterparts, 
followed by detailed analysis of the raw sequence data in the contigs from the S. 
pyogenes database (http://dnal.chem.ou.edu/strep.html) or the S. aureus database 
(http://www.genome.ou.edu/staph.html) to identify the open reading frames. In many 
instances, nucleotide errors were observed causing frameshifts in the open reading 
frame thus truncating it. Therefore, upon cloning the genes via PCR, the genes were 
sequenced to obtain correct information. Also, the full nucleotide sequence of the ssb 
gene was not present in the data base. This was cloned by circular PCR and the full 
sequence is reported below. 

The 5. aureus dnaX and dnaE genes were identified by aligning genes 
of several organisms and designing primers for use in PCR to obtain a gene fragment, 
followed by steps to identify the entire gene. 

One aspect of the present invention relates to a newly discovered Pol 
in gene (herein identified as dnaE) of S. aureus whose encoded protein is homologous 
to E. coli alpha (product of dnaE gene). The partial nucleotide sequence of the S. 
aureus dnaE gene corresponds to SEQ. ID. No. 1 as follows: 



atggtggcat 






tatgatttgt 




cttaaaaata 


60 


gaagatgccg 


taagacttgc 


tgtgtctgaa 


aatgttgatg 


cacttgccat 


aactgacacc 


120 


aatgtattgt 


atggttttcc 


taaattttat 


gatgcatgta 


tagcaaataa 


cattaaaccg 


180 


atttttggta 


tgacaatata 


tgtgacaaat 


ggattaaata 




agttgttcta 


240 


gctaaaaata 


atgatggatt 


aaaagatttg 




catcggaaac 


aaaaatgaat 


300 


gcattagaac 


atgtgtcgtt 


tgaattatta 


aaacgatttt 




gattatcatt 


360 


tttaaaaaag 


tcggtgatca 


acatcgtgat 


attgtacaag 




ccataatgac 


420 


acatatatgg 




tatttcgatt 


caaggtagaa 


aacatgtttg 




480 


gtttgttacc 


aaacacgtca 


agatgccgat 


acgatttctg 


cattagcagc 


tattagagac 


540 


aatacaaaat 


tagacttaat 


tcatgatcaa 


gaagattttg 


gtgcacattt 




600 


aaggaaatta 


atcaattaga 


tattaaccaa 




cgcaggttga 


tgttatagct 


660 


caaaagtgtg 


atgcagaatt 


aaaatatcat 




ttcctcaata 


tgagacacct 


720 


aatgatgaat 


cagctaaaaa 


atatttgtgg 


cgtgtcttag 






780 


gaacttaatt 


atgacgtcta 


tttagagcga 


ttgaaatatg 






340 


atgggttttg 


aagattattt 


cttaatagta 




tccattatgc 


gaaaacgaat 


900 


gatgtgatgg 


taggtcctgg 


tcgtggttct 


tcagctggct 


cactggtcag 


ttatttattg 


960 


ggaattacaa 


cgattgatcc 


tattaaattc 


aatctattat 


ttgaacgttt 


tttaaaccca 


1020 


gaacgtgtaa 


caatgcctga 


tattgatatt 


gactttgaag 


atacacgccg 


agaaagggtc 


1080 


attcagtacg 


tccaagaaaa 


atatggcgag 


ctacatgtat 


ctggaattgt 


gactttcggt 


1140 




caagagcagt 


tgctagagat 


gttggaagaa 


ttatggggtt 


tgatgaagtt 


1200 


acattaaatg 


aaatttcaag 


tttaatccca 


cataaattag 


gaattacact 


tgatgaagca 


1260 


tatcaaattg 


acgattttaa 


agagtttgta 


catcgaaacc 


atcgacatga 


acgctggttc 


1320 


agtatttgta 


aaaagttaga 


aggtttacca 


agacatacat 


ctacacatgc 


ggcaggaatt 


1380 


actattaatg 


accatccatt 


atatgaatat 


gcccctttaa 


cgaaagggga 


tacaggatta 


1440 


ttaacgcaat 


ggacaatgac 


tgaagccgaa 


cgtattgggt 


tattaaaaat 


agattttcta 


1500 


gggttgagaa 


acttatcgat 


tattcatcaa 


atcttaacac 


aagtcaaaaa 


agatttaggt 


1560 
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attaatattg atatcgaaaa gattccgttt gatgatcaaa aagtgtttga attgttgtcg 1620 
caaggagata cgactggcat attccaatta gagtctgacg gtgtaagaag tgtattaaaa 1680 
aaattaaagc cggaacactt tgaagatatt gttgctgtaa cttctttgta tagaccaggt 1740 
ccaatggaag aaattccaac ttacattaca agaagacatg atccaagcaa agttcaatat 1800 
ttacatccgc atttagaacc tatactaaaa aatacttacg gtgttattat ttatcaagag 1860 
caaattatgc aaatagcgag cacatttgca aacttcagtt atggtgaagc ggatatctta 1920 
agaagagcaa tgagtaaaaa aaatagagct gttcttgaaa gtgagcgCca acattttata 1980 
gaaggtgcaa agcaaaatgg ttatcacgaa gacattagta agcaaatatt tgatttgatt 2040 
ctgaaatttg ctgattatgg ttttcctaga gcacatgctg tcagctattc taaaattgca 2100 
tacattatga gctttttaaa agtccattat ccaaattatt tttacgcaaa tattttaagt 2160 
aatgttattg gaagtgagaa gaaaactgct caaatgatag aagaagcaaa aaaacaaggt 2220 
atcactatat tgccaccgaa cattaacgaa agtcattggt tttataaacc tccccaagaa 2280 
ggcatttatt tatcaattgg tacaattaaa ggtgttggtt atcaaagtgt gaaagtgatt 2340 
gttgatgaac gttatcagaa cggcaaattt aaagatttct tCgattttgc tagacgtata 2400 
ccgaagagag tcaaaacgag aaagttacct gaagcactga ttttagtggg agcgtttgat 24 60 
gcttttggta aaacacgttc aacgttgttg caagctattg atcaagtgtt ggatggcgat 2520 
ttaaacattg aacaagatgg ttttttattt gatattttaa cgccaaaaca gatgtatgaa 2580 
gataaagaag aattgcctga tgcacttatt agtcagtacg aaaaagaata tttaggattt 2640 
tatgtttcgc aacacccagt agataaaaag tttgttgcca aacaatattt aacgatattt 2700 
aaattgagta acgcgcagaa ttataaacct atatcagtac agtttgataa agttaaacaa 2760 
attcgaacta aaaatggtca aaatatggca ttcgtcacat taaatgatgg cactgaaact 2820 
ttagacggtg tgattttccc taatcagttt aaaaagtacg aagagttgtt atcacataat 2880 
gacttgttta tagttagcgg gaaatttgac catagaaagc aacaacgtca actaattata 2940 
aatgagattc agacattagc cacttttgaa gaacaaaaat tagcatttgc caaacaaatt 3000 
ataattagaa ataaatcaca aatagatatg tttgaagaga tgattaaagc tacgaaagag 3060 
aatgctaatg atgttgtgtt atccttttat gatgaaacga ttaaacaaat gactacttta 3120 
ggctatatta atcaaaaaga tagtatgttt aataatttta tacaatcctt taaccctagt 3180 
gatattaggc ttata 3195 



The S. aureus dnaE encoded protein, called oc-small, has an amino acid 
sequence corresponding to SEQ. ID. No. 2 as follows: 



Met Val Ala Tyr Leu Asn He His Thr Ala Tyr Asp Leu Leu Asn Ser 
15 10 15 

Ser Leu Lys He Glu Asp Ala Val Arg Leu Ala Val Ser Glu Asn Val 
20 25 30 

Asp Ala Leu Ala He Thr Asp Thr Asn Val Leu Tyr Gly Phe Pro Lys 
35 40 45 

Phe Tyr Asp Ala Cys He Ala Asn Asn He Lys Pro He Phe Gly Met 
50 55 60 

Thr He Tyr Val Thr Asn Gly Leu Asn Thr Val Glu Thr Val Val Leu 
65 70 75 80 

Ala Lys Asn Asn Asp Gly Leu Lys Asp Leu Tyr Gin Leu Ser Ser Glu 
85 90 95 

He Lys Met Asn Ala Leu Glu His Val Ser Phe Glu Leu Leu Lys Arg 
100 105 110 

Phe Ser Asn Asn Met He He He Phe Lys Lys Val Gly Asp Gin His 
115 120 125 

Arg Asp He Val Gin Val Phe Glu Thr His Asn Asp Thr Tyr Met Asp 
130 135 140 

His Leu Ser He Ser He Gin Gly Arg Lys His Val Trp He Gin Asn 
145 150 155 160 
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Val Cys Tyr Gin Thr Arg Gin Asp Ala Asp Thr He Ser Ala Leu Ala 
165 170 175 

Ala He Arg Asp Asn Thr Lys Leu Asp Leu He His Asp Gin Glu Asp 
180 185 190 

Phe Gly Ala His Phe Leu Thr Glu Lys Glu He Asn Gin Leu Asp He 
195 200 205 

Asn Gin Glu Tyr Leu Thr Gin Val Asp Val He Ala Gin Lys Cys Asp 
210 215 220 

' Ala Glu Leu Lys Tyr His i 
225 230 

Asn Asp Glu Ser Ala Lys Lys Tyr Leu Trp Arg Val Leu Val Thr Gin 
245 250 255 

Leu Lys Lys Leu Glu Leu Asn Tyr Asp Val Tyr Leu Glu Arg Leu Lys 
260 265 270 

Tyr Glu Tyr Lys Val He Thr Asn Met Gly Phe Glu Asp Tyr Phe Leu 
275 280 285 

He Val Ser Asp Leu He His Tyr Ala Lys Thr Asn Asp Val Met Val 
290 295 300 

Gly Pro Gly Arg Gly Ser Ser Ala Gly Ser Leu Val Ser Tyr Leu Leu 
305 310 315 320 

Gly He Thr Thr He Asp Pro He Lys Phe Asn Leu Leu Phe Glu Arg 
325 330 335 

Phe Leu Asn Pro Glu Arg Val Thr Met Pro Asp He Asp He Asp Phe 
340 345 350 

Glu Asp Thr Arg Arg Glu Arg Val He Gin Tyr Val Gin Glu Lys Tyr 
355 360 365 

Gly Glu Leu His Val Ser Gly He Val Thr Phe Gly His Leu Leu Ala 
370 375 380 

Arg Ala Val Ala Arg Asp Val Gly Arg He Met Gly Phe Asp Glu Val 
385 390 395 ' 400 

Thr Leu Asn Glu He Ser Ser Leu He Pro His Lys Leu Gly He Thr 

405 410 " 415 

Leu Asp Glu Ala Tyr Gin He Asp Asp Phe Lys Glu Phe Val His Arg 
420 425 430 

Asn His Arg His Glu Arg Trp Phe Ser He Cys Lys Lys Leu Glu Gly 
435 440 445 

Leu Pro Arg His Thr Ser Thr His Ala Ala Gly He He He Asn Asp 
450 455 460 

His Pro Leu Tyr Glu Tyr Ala Pro Leu Thr Lys Gly Asp Thr Gly Leu 
465 470 475 480 

Leu Thr Gin Trp Thr Met Thr Glu Ala Glu Arg He Gly Leu Leu Lys 
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lie Asp Phe Leu Gly Leu Arg Asn Leu Ser He He His Gin He Leu 
500 505 510 

Thr Gin Val Lys Lys Asp Leu Gly He Asn He Asp He Glu Lvs He 
515 520 , 525 

Pro Phe Asp Asp Gin Lys Val Phe Glu Leu Leu Ser Gin Gly Asp Thr 
530 535 540 

Thr Gly He Phe Gin Leu Glu Ser Asp Gly Val Arg Ser Val Leu Lys 
545 550 555 560 

Lys Leu Lys Pro Glu His Phe Glu Asp He Val Ala Val Thr Ser Leu 
565 570 575 

Tyr Arg Pro Gly Pro Met Glu Glu He Pro Thr Tyr He Thr Arg Arg 
580 585 590 

His Asp Pro Ser Lys Val Gin Tyr Leu His Pro His Leu Glu Pro He 
595 600 605 

Leu Lys Asn Thr Tyr Gly Val He He Tyr Gin Glu Gin He Met Gin 
610 615 620 

He Ala Ser Thr Phe Ala Asn Phe Ser Tyr Gly Glu Ala Asp He Leu 
625 630 635 640 

Arg Arg Ala Met Ser Lys Lys Asn Arg Ala Val Leu Glu Ser Glu Arg 
645 650 655 

Gin His Phe He Glu Gly Ala Lys Gin Asn Gly Tyr His Glu Asp He 
660 665 670 

Ser Lys Gin He Phe Asp Leu He Leu Lys Phe Ala Asp Tyr Gly Phe 
675 680 685 

Pro Arg Ala His Ala Val Ser Tyr Ser Lys He Ala Tyr He Met Ser 
690 695 700 

Phe Leu Lys Val His Tyr Pro Asn Tyr Phe Tyr Ala Asn He Leu Ser 
705 710 715 720 

Asn Val He Gly Ser Glu Lys Lys Thr Ala Gin Met He Glu Glu Ala 
725 730 735 

Lys Lys Gin Gly He Thr He Leu Pro Pro Asn He Asn Glu Ser His 
740 745 750 

Trp Phe Tyr Lys Pro Ser Gin Glu Gly He Tyr Leu Ser He Gly Thr 
755 760 765 

He Lys Gly Val Gly Tyr Gin Ser Val Lys Val He Val Asp Glu Arg 
770 775 780 

Tyr Gin Asn Gly Lys Phe Lys Asp Phe Phe Asp Phe Ala Arg Arg He 
785 790 795 800 

Pro Lys Arg Val Lys Thr Arg Lys Leu Leu Glu Ala Leu He Leu Val 
805 810 815 

Gly Ala Phe Asp Ala Phe Gly Lys Thr Arg Ser Thr Leu Leu Gin Ala 
620 825 ~ 830 
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Leu Pro Asp Ala Leu lie Ser Gin Tyr Glu Lys Glu Tyr Leu Gly Phe 

865 870 875 880 

Tyr Val Ser Gin His Pro Val Asp Lys Lys Phe Val Ala Lys Gin Tyr 

885 890 895 



Met Ala Phe Val Thr Leu Asn Asp Gly lie Glu Thr Leu Asp Gly Val 
930 935 940 



Gin Leu lie lie Asn Glu lie Gin Thr Leu Ala Thr Phe Glu Glu Gin 

980 985 990 

Lys Leu Ala Phe Ala Lys Gin He He He Arg Asn Lys Ser Gin He 

995 1000 1005 



Gly Tyr He Asn Gin Lys Asp Ser Met Phe Asn Asn Phe He Gin Ser 
1045 1050 1055 



The present invention also relates to the S. aureus dnaX gene. This 
S. aureus dnaX gene has a partial nucleotide sequence corresponding to SEQ. ID. 
No. 3 as follows: 



ttgaattatc aagccttata tcgtatgtac agaccccaaa gtttcgagga tgtcgtcgga 60 

caagaacatg tcacgaagac attgcgcaat gcgatttcga aagaaaaaca gtcgcatgca 120 

tatattttta gtggtccgag aggtacgggg aaaacgagta ttgccaaagt gtttgctaaa 180 

gcaatcaact gtttaaatag cactgatgga gaaccttgta atgaatgtca tatttgtaaa 240 

ggcattacgc aggggactaa ttcagatgtg atagaaattg atgctgctag taataatggc 300 

gttgatgaaa taagaaatat tagagacaaa gttaaatatg caccaagtga atcgaaatat 360 

aaagtttata ttatagatga ggtgcacatg ctaacaacag gtgcttttaa tgccctttta 420 

aagacgttag aagaacctcc agcacacgct atttttatat tggcaacgac agaaccacat 480 

aaaatccctc caacaatcat ttctagggca caacgttttg attttaaagc aattagccta 540 

gatcaaattg ttgaacgttt aaaatttgta gcagatgcac aacaaattga atgtgaagat 600 

gaagccttgg catttatcgc taaagcgtct gaagggggta tgcgtgatgc attaagtatt 660 
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atggatcagg ctattgcttt cggcgatggc acattgacat tacaagatgc cctaaatgtt 720 

acgggtagcg ttcatgatga agcgttggac cacttgtttg atgatattgt acaaggtgac 780 

gtacaagcat cttttaaaaa ataccatcag cttataacag aaggtaaaga agtgaatcgc 840 

ctaacaaatg atatgattta ttttgtcaga gatacgatta tqaataaaac atctgagaaa 900 

gataccgagt atcgagcact gatgaactta gaattagata tgttatatca aatgattgat 960 

cttattaatg atacattagt gtcgattcgt tttagtgtga atcaaaacgt tcattttgaa 1020 

gtattgttag taaaattagc tgagcagatt aagggtcaac cacaagtgat tgcgaatgta 1080 

gctgaaccag cacaaattgc ttcatcgcca aacacagatg tattgttgca acgtatggaa 1140 

cagttagagc aagaactaaa aacactaaaa gcacaaggag tgagtgtcgc tcctactcaa 1200 

aaatcttcga aaaagcctgc gagaggtata caaaaatcta aaaatgcatt ttcaatgcaa 1260 

caaattgcaa aagtgccaga taaagcgaat aaggcagata tcaaattgct gaaagatcat 1320 

tggcaagaag tgattgacca tgcccaaaac aatgataaaa aatcactcgt tagtttattg 1380 

caaaattcgg aacctgtggc ggcaagtgaa gatcacgtcc ttgtgaaatt tgaggaagag 1440 

atccattgtg aaatcgtcaa taaagacgac gagaaacgCa gtagtataga aagtgttgta 1500 

tgtaatatcg ttaataaaaa cgttaaagtt gttggtgtac catcagatca atggcaaaga 1560 

gttcgaacgg agtatttaca aaatcgtaaa aacgaaggcg atgatatgcc aaagcaacaa 1620 

gcacaacaaa cagatattgc tcaaaaagca aaagatcttt tcggtgaaga aactgtacat 1680 

gtgatagatg aagagtga 1698 



The S. aureus dnaX encoded protein (i.e., the tau subunit) has a partial" 
amino acid sequence corresponding to SEQ. ID. No. 4 as follows: 

Leu Asn Tyr Gin Ala Leu Tyr Arg Met Tyr Arg Pro Gin Ser Phe Glu 
15 10 15 

Asp Val Val Gly Gin Glu His Val Thr Lys Thr Leu Arg Asn Ala He 
20 25 30 

Ser Lys Glu Lys Gin Ser His Ala Tyr He Phe Ser Gly Pro Arg Gly 
35 40 45 

Thr Gly Lys Thr Ser He Ala Lys Val Phe Ala Lys Ala He Asn Cys 
50 55 60 

Leu Asn Ser Thr Asp Gly Glu Pro Cys Asn Glu Cys His He Cys Lys 
65 70 75 80 

Gly He Thr Gin Gly Thr Asn Ser Asp Val He Glu He Asp Ala Ala 
85 90 95 

Ser Asn Asn Gly Val Asp Glu He Arg Asn He Arg Asp Lys Val Lys 
100 105 110 



Ala He Ser Leu Asp Gin He Val Glu Arg Leu Lys Phe Val Ala Asp 
180 185 190 
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Ala Ser Glu Gly Gly Met Arg Asp Ala Leu Ser lie Met Asp Gin Ala 
210 215 220 

lie Ala Phe Gly Asp Gly Thr Leu Thr Leu Gin Asp Ala Leu Asn Val 
225 230 235 240 

Thr Gly Ser Val His Asp Glu Ala Leu Asp His Leu Phe Asp Asp lie 
245 250 255 

Val Gin Gly Asp Val Gin Ala Ser Phe Lys Lys Tyr His Gin Phe He 
260 265 270 

Thr Glu Gly Lys Glu Val Asn Arg Leu He Asn Asp Met He Tyr Phe 
275 280 285 



Arg Ala Leu Met Asn Leu Glu Leu Asp Met Leu Tyr Gin Met He Asp 

305 310 315 320 

Leu He Asn Asp Thr Leu Val Ser He Arg Phe Ser Val Asn Gin Asn 

325 330 335 



Gin Pro Gin Val lie Ala Asn Val Ala Glu Pro Ala Gin He Ala Ser 
355 360 365 



Glu Leu Lys Thr Leu Lys Ala Gin Gly Val Ser Val Ala Pro Thr Gin 
385 390 395 400 



Pro Val Ala Ala Ser Glu Asp His Val Leu Val Lys Phe Glu Glu Glu 
465 470 475 480 

He His Cys Glu He Val Asn Lys Asp Asp Glu Lys Arg Ser Ser He 
485 490 495 

Glu Ser Val Val Cys Asn He Val Asn Lys Asn Val Lys Val Val Gly 
500 505 510 

Val Pro Ser Asp Gin Trp Gin Arg Val Arg Thr Glu Tyr Leu Gin Asn 
515 520 525 

Arg Lys Asn Glu Gly Asp Asp Met Pro Lys Gin Gin Ala Gin Gin Thr 
530 535 540 
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Asp He Ala Gin Lys Ala Lys Asp Leu Phe Gly Glu Glu Thr Val His 
545 550 555 560 

Val He Asp Glu Glu Glx 
565 

The tau subunit of S. aureus functions as does both the tau subunit and the gamma 
subunit of E. coli. 

This invention also relates to the partial nucleotide sequence of the 
S. aureus dnaB gene. The partial nucleotide sequence of this dnaB gene corresponds 
to SEQ. ID. No. 5 as follows: 



atggatagaa 


tgtatgagca 


aaatcaaatg 


ccgcataaca 


atgaagctga 


acagtctgtc 


60 


ttaggttcaa 




tccagaattg 


attaatacta 


ctcaggaagt 


tttgcttcct 


120 


gagtcgtttt 


ataggggtgc 


ccatcaacat 


attttccgtg 


caatgatgca 


cttaaatgaa 


180 


gataataaag 


aaattgatgt 


tgtaacattg 


atggatcaat 


tatcgacgga 


aggtacgttg 


240 


aatgaagcgg 


gtggcccgca 


atatcttgca 


gagttatcta 


caaatgtacc 


aacgacgcga 


300 


aatgttcagt 


attatactga 


tatcgtttct 


aagcatgcat 


taaaacgtag 


attgattcaa 


360 


actgcagata 


gtattgccaa 


tgatggatat 


aatgatgaac 


ttgaactaga 




420 


agtgatgcag 


aacgtcgaat 


tttagagcta 




gtgaaagcga 




480 


gacattcgag 


acgtcttagg 


acaagtgtat 


gaaacagctg 


aagagcttga 


tcaaaatagt 


540 


ggtcaaacac 


caggtatacc 


tacaggatat 


cgagatttag 


accaaatgac 


agcagggttc 


600 


aaccgaaatg 


atttaattat 


ccttgcagcg 


cgtccatctg 


taggtaagac 


tgcgttcgca 


660 


cttaatattg 


cacaaaaagt 


tgcaacgcat 


gaagatatgt 


atacagttgg 


tattttctcg 


720 


ctagagatgg 


gtgctgatca 


gttagccaca 


cgtatgattt 


gtagttctgg 


aaatgttgac 


780 


tcaaaccgct 


taagaacggg 




gaggaagatt 


ggagtcgttt 


tactatagcg 


840 


gtaggtaaac 


tatcacgtac 


gaagattttt 


attgatgata 


caccgggtat 


tcgaattaat 


900 




ctaaatgtcg 


tcgattaaag 


caagaacatg 


gcttagacat 


gattgtgatt 


960 


gactacttac 




aggtagtggt 


tcacgtgcgt 


ccgataacag 


acaacaggaa 


1020 


gtttctgaaa 


tctctcgtac 


attaaaagca 


ttagcccgtg 


aattaaaatg 




1080 


gcattaagtc 




tggtgttgaa 


caacgacaag 


ataaacgtcc 


aatgatgagt 


1140 


gatattcgtg 


aatctggttc 


gattgagcaa 


gatgccgata 




cttataccgt 


1200 


gatgattact 


ataaccgtgg 


cggcgatgaa 


gatgatgacg 


atgatggtgg 


tttcgagcca 


1260 


caaacgaatg 


atgaaaacgg 


tgaaattgaa 


attatcattg 


ctaagcaacg 


taacggtcca 


1320 


acaggcacag 


ttaagttaca 


ttttatgaaa 


caatataata 


aatttaccga 


tatcgattat 


1380 


gcacatgcag 


atatgatg 










1398 



The amino acid sequence of S. aureus DnaB encoded by the dnaB gene 
corresponds to SEQ. ID. No. 6 as follows: 

Met Asp Arg Met Tyr Glu Gin Asn Gin Met Pro His Asn Asn Glu Ala 
1 5 10 15 

Glu Gin Ser Val Leu Gly Ser He He He Asp Pro Glu Leu He Asn 
20 25 30 

Thr Thr Gin Glu Val Leu Leu Pro Glu Ser Phe Tyr Arg Gly Ala His 
35 40 45 

Gin His He Phe Arg Ala Met Met His Leu Asn Glu Asp Asn Lys Glu 
50 55 60 

He Asp Val Val Thr Leu Met Asp Gin Leu Ser Thr Glu Gly Thr Leu 
65 70 75 80 
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Asn Glu Ala Gly Gly Pro Gin Tyr Leu Ala Glu Leu Ser Thr Asn Val 



Ala Leu Lys Arg Arg Leu He Gin Thr Ala Asp Ser He Ala Asn Asp 
115 120 125 

Gly Tyr Asn Asp Glu Leu Glu Leu Asp Ala He Leu Ser Asp Ala Glu 
130 135 140 

Arg Arg He Leu Glu Leu Ser Ser Ser Arg Glu Ser Asp Gly Phe Lys 
145 150 155 ' 160 

Asp He Arg Asp Val Leu Gly Gin Val Tyr Glu Thr Ala Glu Glu Leu 
165 170 175 

Asp Gin Asn Ser Gly Gin Thr Pro Gly He Pro Thr Gly Tyr Arg Asp 
180 185 190 

Leu Asp Gin Met Thr Ala Gly Phe Asn Arg Asn Asp Leu He He Leu 
195 200 205 

Ala Ala Arg Pro Ser Val Gly Lys Thr Ala Phe Ala Leu Asn He Ala 
210 215 220 

Gin Lys Val Ala Thr His Glu Asp Met Tyr Thr Val Gly He Phe Ser 
225 230 235 240 

Leu Glu Met Gly Ala Asp Gin Leu Ala Thr Arg Met He Cys Ser Ser 
245 250 255 

Gly Asn Val Asp Ser Asn Arg Leu Arg Thr Gly Thr Met Thr Glu Glu 
260 265 270 



He Phe He Asp Asp Thr Pro Gly He Arg He Asn Asp Leu Arg Ser 
290 295 300 

Lys Cys Arg Arg Leu Lys Gin Glu His Gly Leu Asp Met He Val He 
305 310 315 320 

Asp Tyr Leu Gin Leu He Gin Gly Ser Gly Ser Arg Ala Ser Asp Asn 
325 330 ~ 335 

Arg Gin Gin Glu Val Ser Glu He Ser Arg Thr Leu Lys Ala Leu Ala 
340 345 350 

Arg Glu Leu Lys Cys Pro Val He Ala Leu Ser Gin Leu Ser Arg Gly 
355 360 365 

Val Glu Gin Arg Gin Asp Lys Arg Pro Met Met Ser Asp He Arg Glu 
370 375 380 

Ser Gly Ser He Glu Gin Asp Ala Asp He Val Ala Phe Leu Tyr Arg 
385 390 395 400 

Asp Asp Tyr Tyr Asn Arg Gly Gly Asp Glu Asp Asp Asp Asp Asp Gly 
405 410 415 
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The present invention also relates to the S. aureus polC gene (encoding 
Pol m-L or a-large). The partial nucleotide sequence of this polC gene corresponds 
to SEQ. ID. No. 7 as follows: 



atgacagagc aacaaaaatt taaagtgctt gctgatcaaa ttaaaatttc aaatcaatta 60 
gatgctgaaa ttttaaattc aggtgaactg acacgtatag atgtctctaa caaaaacaga 120 
acatgggaat ttcatattac actaccacaa ttcttagctc atgaagatta tttattattt 180 
ataaatgcaa tagagcaaga gtttaaagat atcgccaacg ttacatgtcg ttttacggta 240 
acaaatggca cgaatcaaga tgaacatgca attaaatact ttgggcactg tatcgaccaa 300 
acagctttat ctccaaaagt taaaggtcaa ttgaaacaga aaaagcttat tatgtctgga 360 
aaagtattaa aagtaatggt atcaaatgac attgaacgta atcattttga taaggcatgt 420 
aatggaagtc ttatcaaagc gtttagaaat tgtggttttg atatcgataa aatcatattc 480 
gaaacaaatg ataatgatca agaacaaaac ttagcttctt tagaagcaca tattcaagaa 540 
gaagacgaac aaagtgcacg attggcaaca gagaaacttg aaaaaatgaa agctgaaaaa 600 
gcgaaacaac aagataacaa cgaaagtgct gtcgataagt gtcaaattgg taagccgatt 660 
caaattgaaa atattaaacc aaccgaatct attattgagg aagagtttaa agttgcaata 720 
gagggtgtca tttttgatat aaacttaaaa gaacttaaaa gtggtcgcca tatcgtagaa 780 
attaaagtga ctgactatac ggactcttta gttttaaaaa tgtttactcg taaaaacaaa 840 
gatgatttag aacattttaa agcgctaagt gttggtaaat gggttagggc tcaaggtcgt 900 
attgaagaag atacatttat tagagattta gttatgatga tgtctgatat tgaagagatt 960 
aaaaaagcga caaaaaaaga taaggctgaa gaaaagcgtg tagaattcca cttgcatacC 1020 
gcaatgagcc aaatggatgg tatacccaat attggtgcgt atgttaaaca ggcagcagac 1080 
tggggacatc cagccattgc ggttacagac cataatgttg tgcaagcatt tccagatgct 1140 
cacgcagcag cggaaaaaca tggcattaaa atgatatacg gtatggaagg tatgttagtt 1200 
gatgatggtg ttccgattgc atacaaacca caagatgtcg tattaaaaga cgctacttat 1260 
gttgtgttcg acgttgagac aactggttta tcaaatcagt atgataaaat catcgagctt 1320 
gcagctgtga aagttcataa cggtgaaatc atcgataagt ttgaaaggtc tagtaatccg 1380 
catgaacgat tatcggaaac gattatcaat ttgacgcata ttactgatga tatgttagta 1440 
gatgcccctg agattgaaga agtacttaca gagtttaaag aatgggttgg cgatgcgata 1500 
ttcgtagcgc ataatgcttc gtttgatatg ggcttcatcg atacgggata tgaacgtctt 1560 
gggtttggac catcaacgaa tggtgttatc gatactttag aattatctcg tacgattaat 1620 
actgaatatg gtaaacatgg tttgaatttc ttggctaaaa aatatggcgt agaattaacg 1680 
caacatcacc gtgccatcta tgatacagaa gcaacagctc acattttcat aaaaatggtt 1740 
caacaaatga aagaattagg cgtattaaat cataacgaaa tcaacaaaaa actcagtaat 1800 
gaagatgcat ataaacgtgc aagacctagt catgtcacat taattgtaca aaaccaacaa 1860 
ggtcttaaaa atctatttaa aattgtaagt gcatcattgg tgaagtattt ctaccgtaca 1920 
cctcgaattc cacgttcatt gttagatgaa tatcgtgagg gattattggt aggtacagcg 1980 
tgtgatgaag gtgaattatt tacggcagtt atgcagaagg accagagtca agttgaaaaa 2040 
attgccaaat attatgattt tattgaaatt caaccaccgg cactttatca agatttaatt 2100 
gatagagagc ttattagaga tactgaaaca ttacatgaaa tttatcaacg tttaatacat 2160 
gcaggtgaca cagcgggtat acctgttatt gcgacaggaa atgcacacta tttgtttgaa 2220 
catgatggta tcgcacgtaa aattttaata gcatcacaac ccggcaatcc acttaatcgc 2280 
tcaactttac cggaagcaca ttttagaact acagatgaaa tgttaaacga gtttcatttt 2340 
ttaggtgaag aaaaagcgca tgaaattgtt gtgaaaaata caaacgaatt agcagatcga 2400 
attgaacgtg ttgttcctat taaagatgaa ttatacacac cgcgtatgga aggtgctaac 2460 
gaagaaatta gagaactaag ttatgcaaat gcgcgtaaac tgtatggtga agacctgcct 2520 
caaatcgtaa ttgatcgatt agaaaaagaa ttaaaaagta ttatcggtaa tggatttgcg 2580 
gtaatttact taatttcgca acgtttagtt aaaaaatcat tagatgatgg atacttagtt 2640 
ggttcccgtg gttcagtagg ttctagtttt gcagcgacaa tgactgagat tactgaagta 2700 
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aacccgttac cgccacacta tatttgtccg aactgtaaaa cgagtgaatt Cttcaatgat 2760 
ggttcagtag gatcaggatt tgatttacct gataagacgt gtgaaacttg tggagcgcca 2820 
cttattaaag aaggacaaga tattccgttt gaaacatttt taggatttaa gggagataaa 2880 
gttcctgata tcgacttaaa ctttagtggt gaatatcaac cgaatgccca taactacaca 2940 
aaagtattat ttggtgagga taaagtattc cgtgcaggta caattggtac tgttgctgaa 3000 
aagactgctt ttggttatgt taaaggttat ttgaatgatc aaggtatcca caaaagaggt 3060 
gctgaaatag atcgactcgt taaaggatgt acaggtgtta aacgtacaac tggacagcat 3120 
ccagggggta ttattgtagt acctgattac atggatattt atgatttcac gccgatacaa 3180 
tatcctgccg atgatcaaaa ttcagcatgg atgacgacac attttgattt ccattctatt 3240 
catgataatg tattaaaact tgatatactt ggacacgatg atccaacaat gattcgtatg 3300 
cttcaagatt tatcaggaat tgatccaaaa acaatacctg tagatgataa agaagttatg 3360 
cagatattta gtacacctga aagtttgggt gttactgaag atgaaatttt atgtaaaaca 3420 
ggtacatttg gggtaccaga attcggtaca ggattcgtgc gtcaaatgtt agaagataca 3480 
aagccaacaa cattttctga attagttcaa atctcaggat tatctcatgg tacagatgtg 3540 
tggttaggca atgctcaaga attaattaaa accggtatat gtgatttatc aagtgtaatt 3600 
ggttgtcgtg atgatatcat ggtttattta atgtatgctg gtttagaacc atcaatggct 3660 
tttaaaataa tggagtcagt acgtaaaggt aaaggtttaa ctgaagaaat gattgaaacg 3720 
atgaaagaaa atgaagtgcc agattggtat ttagattcat gtcttaaaat taagtacatg 3780 
ttccctaaag cccatgcagc agcatacgtt ttaatggcag tacgtatcgc atatttcaaa 3840 
gtacatcatc cactttatta ctatgcatct tactttacaa ttcgtgcgtc agactttgat 3900 
ttaatcacga tgattaaaga taaaacaagc attcgaaata ctgtaaaaga catgtattct 3960 
cgctatatgg atctaggtaa aaaagaaaaa gacgtattaa cagtcttgga aattatgaat 4020 
gaaatggcgc atcgaggtta tcgaatgcaa ccgattagtt tagaaaagag tcaggcgttc 4080 
gaatttatca ttgaaggcga tacacttatt ccgccgttca tatcagtgcc tgggcttggc 4140 
gaaaacgttg cgaaacgaat tgttgaagct cgtgacgatg gcccattttt atcaaaagaa 4200 
gatttaaaca aaaaagctgg attatctcag aaaattattg agtatttaga tgagttaggc 4260 
tcattaccga atttaccaga taaagctcaa ctttcgatat ttgatatg 4308 



The amino acid sequence of the S. aureus polC gene product, cc-large, 
corresponds to SEQ. ID. No. 8 as follows: 



Met Thr Glu Gin Gin Lys Phe Lys Val Leu Ala Asp Gin He Lys He 
15 10 15 

Ser Asn Gin Leu Asp Ala Glu lie Leu Asn Ser Gly Glu Leu Thr Arg 
20 25 30 

He Asp Val Ser Asn Lys Asn Arg Thr Trp Glu Phe His He Thr Leu 
35 40 45 

Pro Gin Phe Leu Ala His Glu Asp Tyr Leu Leu Phe He Asn Ala He 
50 55 60 

Glu Gin Glu Phe Lys Asp He Ala Asn Val Thr Cys Arg Phe Thr Val 
65 70 75 " 80 

Thr Asn Gly Thr Asn Gin Asp Glu His Ala He Lys Tyr Phe Gly His 
85 90 95 

Cys lie Asp Gin Thr Ala Leu Ser Pro Lys Val Lys Gly Gin Leu Lys 
100 105 110 

Gin Lys Lys Leu He Met Ser Gly Lys Val Leu Lys Val Met Val Ser 
115 120 125 

Asn Asp He Glu Arg Asn His Phe Asp Lys Ala Cys Asn Gly Ser Leu 
130 135 140 

He Lys Ala Phe Arg Asn Cys Gly Phe Asp He Asp Lys He He Phe 
145 150 155 160 
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Glu Thr Asn Asp Asn Asp Gin Glu Gin Asn Leu Ala Ser Leu Glu Ala 

165 170 175 

His lie Gin Glu Glu Asp Glu Gin Ser Ala Arg Leu Ala Thr Glu Lys 
180 165 190 

Leu Glu Lys Met Lys Ala Glu Lys Ala Lys Gin Gin Asp Asn Lys Gin 
195 200 205 

Ser Ala Val Asp Lys Cys Gin lie Gly Lys Pro lie Gin He Glu Asn 
210 215 220 

He Lys Pro He Glu Ser He He Glu Glu Glu Phe Lys Val Ala He 
225 230 235 240 

Glu Gly Val He Phe Asp He Asn Leu Lys Glu Leu Lys Ser Gly Arg 
245 250 255 

His He Val Glu He Lys Val Thr Asp Tyr Thr Asp Ser Leu Val Leu 
260 265 270 



Lys Lys Ala Thr Lys Lys Asp Lys Ala Glu Glu Lys Arg Val Glu Phe 
325 330 335 

His Leu His Thr Ala Met Ser Gin Met Asp Gly He Pro Asn He Gly 
340 345 350 

Ala Tyr Val Lys Gin Ala Ala Asp Trp Gly His "Pro Ala He Ala Val 

355 360 365 



Glu Lys His Gly He Lys Met He Tyr Gly Met Glu Gly Met Leu Val 
385 390 395 400 

Asp Asp Gly Val Pro He Ala Tyr Lys Pro Gin Asp Val Val Leu Lys 
405 410 415 

Asp Ala Thr Tyr Val Val Phe Asp Val Glu Thr Thr Gly Leu Ser Asn 
420 425 430 

Gin Tyr Asp Lys He He Glu Leu Ala Ala Val Lys Val His Asn Gly 
435 440 445 



Asp Ala Pro Glu He Glu Glu Val Leu Thr Glu Phe Lys Glu Trp Val 
485 490 495 
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Gly Asp Ala He Phe Val Ala His Asn Ala Ser Phe Asp Met Gly Phe 
500 505 510 

He Asp Thr Gly Tyr Glu Arg Leu Gly Phe Gly Pro Ser Thr Asn Gly 
515 520 525 

Val He Asp Thr Leu Glu Leu Ser Arg Thr He Asn Thr Glu Tyr Gly 
530 535 540 

Lys His Gly Leu Asn Phe Leu Ala Lys Lys Tyr Gly Val Glu Leu Thr 
545 550 555 560 

Gin His His Arg Ala He Tyr Asp Thr Glu Ala Thr Ala Tyr He Phe 
565 570 575 

lie Lys Met Val Gin Gin Met Lys Glu Leu Gly Val Leu Asn His Asn 
580 585 590 

Glu He Asn Lys Lys Leu Ser Asn Glu Asp Ala Tyr Lys Arg Ala Arg 
595 600 605 

Pro Ser His Val Thr Leu He Val Gin Asn Gin Gin Gly Leu Lys Asn 
610 615 620 

Leu Phe Lys He Val Ser Ala Ser Leu Val Lys Tyr Phe Tyr Arg Thr 
625 630 635 640 

Pro Arg He Pro Arg Ser Leu Leu Asp Glu Tyr Arg Glu Gly Leu Leu 
645 650 655 

Val Gly Thr Ala Cys Asp Glu Gly Glu Leu Phe Thr Ala Val Met Gin 
660 665 670 

Lys Asp Gin Ser Gin Val Glu Lys He Ala Lys Tyr Tyr Asp Phe He 
675 680 685 

Glu He Gin Pro Pro Ala Leu Tyr Gin Asp Leu He Asp Arg Glu Leu 
690 695 700 

He Arg Asp Thr Glu Thr Leu His Glu He Tyr Gin Arg Leu He His 
"705 710 715 720 

Ala Gly Asp Thr Ala Gly He Pro Val He Ala Thr Gly Asn Ala His 
725 730 735 

Tyr Leu Phe Glu His Asp Gly He Ala Arg Lys He Leu He Ala Ser 
740 745 750 

Gin Pro Gly Asn Pro Leu Asn Arg Ser Thr Leu Pro Glu Ala His Phe 
755 760 765 

Arg Thr Thr Asp Glu Met Leu Asn Glu Phe His Phe Leu Gly Glu Glu 
770 775 780 

Lys Ala His Glu He Val Val Lys Asn Thr Asn Glu Leu Ala Asp Arg 
785 790 795 800 

He Glu Arg Val Val Pro He Lys Asp Glu Leu Tyr Thr Pro Arg Met 
805 810 815 

Glu Gly Ala Asn Glu Glu He Arg Glu Leu Ser Tyr Ala Asn Ala Arg 
820 825 830 
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Lys Leu Tyr Gly Glu Asp Leu Pro Gin He Val He Asp Arg Leu Glu 

835 840 845 

Lys Glu Leu Lys Ser He He Gly Asn Gly Phe Ala Val He Tyr Leu 

850 855 860 

He Ser Gin Arg Leu Val Lys Lys Ser Leu Asp Asp Gly Tyr Leu Val 

865 870 875 880 

Gly Ser Arg Gly Ser Val Gly Ser Ser Phe Val Ala Thr Met Thr Glu 

885 890 895 

lie Thr Glu Val Asn Pro Leu Pro Pro His Tyr He Cys Pro Asn Cys 

900 905 ' 910 

Lys Thr Ser Glu Phe Phe Asn Asp Gly Ser Val Gly Ser Gly Phe Asp 

915 920 925 



Gly Gin Asp He Pro Phe Glu Lys Phe Leu Gly Phe Lys Gly Asp Lys 
945 950 955 960 

Val Pro Asp He Asp Leu Asn Phe Ser Gly Glu Tyr Gin Pro Asn Ala 
965 970 975 

His Asn Tyr Thr Lys Val Leu Phe Gly Glu Asp Lys Val Phe Arg Ala 
980 985 990 

Gly Thr He Gly Thr Val Ala Glu Lys Thr Ala Phe Gly Tyr Val Lys 
995 1000 1005 

Gly Tyr Leu Asn Asp Gin Gly lie His Lys Arg Gly Ala Glu He Asp 
1010 1015 1020 

Arg Leu Val Lys Gly Cys Thr Gly Val Lys Ala Thr Thr Gly Gin His 
1025 1030 1035 1040 

Pro Gly Gly He lie Val Val Pro Asp Tyr Met Asp He Tyr Asp Phe 
1045 1050 ~ 1055 

Thr Pro He Gin Tyr Pro Ala Asp Asp Gin Asn Ser Ala Trp Met Thr 
1060 1065 1070 

Thr His Phe Asp Phe His Ser lie His Asp Asn Val Leu Lys Leu Asp 
1075 1080 1085 

He Leu Gly His Asp Asp Pro Thr Met He Arg Met Leu Gin Asp Leu 
1090 1095 1100 

Ser Gly lie Asp Pro Lys Thr He Pro Val Asp Asp Lys Glu Val Met 
1105 1110 1115 1120 

Gin lie Phe Ser Thr Pro Glu Ser Leu Gly Val Thr Glu Asp Glu lie 
1125 1130 1135 

Leu Cys Lys Thr Gly Thr Phe Gly Val Pro Asn Ser Asp Arg lie Arg 
1140 1145 1150 

Arg Gin Met Leu Glu Asp Thr Lys Pro Thr Thr Phe Ser Glu Leu Val 
1155 1160 1165 
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Gin lie Ser Gly Leu Ser His Gly Thr Asp Val Trp Leu Gly Asn Ala 
1170 1175 1180 

Gin Glu Leu lie Lys Thr Gly lie Cys Asp Leu Ser Ser Val lie Gly 
1185 1190 1195 1200 

Cys Arg Asp Asp lie Met Val Tyr Leu Met Tyr Ala Gly Leu Glu Pro 
1205 1210 1215 

Ser Met Ala Phe Lys He Met Glu Ser Val Arg Lys Gly Lys Gly Leu 
1220 1225 1230 

Thr Glu Glu Met He Glu Thr Met Lys Glu Asn Glu Val Pro Asp Trp 
1235 1240 1245 

Tyr Leu Asp Ser Cys Leu Lys He Lys Tyr He Phe Pro Lys Ala His 
1250 1255 1260 

Ala Ala Ala Tyr Val Leu Met Ala Val Arg He Ala Tyr Phe Lys Val 
1265 1270 1275 1280 

His His Pro Leu Tyr Tyr Tyr Ala Ser Tyr Phe Thr lie Arg Ala Ser 
1285 1290 1295 

Asp Phe Asp Leu lie Thr Met He Lys Asp Lys Thr Ser lie Arg Asn 
1300 1305 1310 

Thr Val Lys Asp Met Tyr Ser Arg Tyr Met Asp Leu Gly Lys Lys Glu 
1315 1320 1325 

Lys Asp Val Leu Thr Val Leu Glu lie Met Asn Glu Met Ala His Arg 
1330 1335 1340 

Gly Tyr Arg Met Gin Pro lie Ser Leu Glu Lys Ser Gin Ala Phe Glu 
1345 1350 1355 1360 

Phe He lie Glu Gly Asp Thr Leu He Pro Pro Phe lie Ser Val Pro 
1365 1370 1375 

Gly Leu Gly Glu Asn Val Ala Lys Arg lie Val Glu Ala Arg Asp Asp 
1380 1385 1390 

Gly Pro Phe Leu Ser Lys Glu Asp Leu Asn Lys Lys Ala Gly Leu Tyr 

1395 ' 1400 1405 

Gin Lys lie lie Glu Tyr Leu Asp Glu Leu Gly Ser Leu Pro Asn Leu 
1410 1415 1420 

Pro Asp Lys Ala Gin Leu Ser He Phe Asp Met 
1425 1430 1435 



This invention also relates to the S. aureus dnaN gene encoding the 
beta subunit. The partial nucleotide sequence of this dnaN gene corresponds to SEQ. 
ID. No. 9 as follows: 



atgatggaat tcactattaa aagagattat tttattacac aattaaatga cacattaaaa 60 

gctatttcac caagaacaac attacctata ttaactggta tcaaaatcga tgcgaaagaa 120 

catgaagtta tattaactgg ttcagactct gaaatttcaa tagaaatcac tattcctaaa 180 

actgtagatg gcgaagatat tgtcaatatt tcagaaacag gctcagtagt acttcctgga 240 
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cgattctttg ttgatattat aaaaaaatta cctggtaaag atgttaaatc atctacaaat 300 

gaacaattcc agacactaat tacatcaggt cattctgaat ttaatttgag tggcttagat 360 

ccagatcaat atcctttatt acctcaagtt tctagagatg acgcaattca attgtcggta 420 

aaagtactta aaaacgtgat tgcacaaacg aattttgcag tgtccacctc agaaacacgc 480 

ccagtactaa ctggtgtgaa ctggcttata caagaaaatg aattaatatg cacagcgact 540 

gattcacacc gcttggctgt aagaaagttg cagttagaag atgtttctga aaacaaaaat 600 

gtcatcattc caggtaaggc tttagctgaa ttaaataaaa ttatgtctga caatgaagaa 660 

gacattgata tcttctttgc ttcaaaccaa gttttattta aagttggaaa tgtgaacttt 720 

atttctcgat tattagaagg acattatcct gatacaacac gtttattccc tgaaaactat 780 

gaaattaaat taagtataga caatggggag ttttatcatg cgattgatcg tgcctcttta 840 

ttagcacgtg aaggtggtaa caacgttatt aaattaagta caggtgacga cgttgttgaa 900 

ttatcttcta catcaccaga aattggtact gtaaaagaag aagttgatgc aaacgatgtt 960 

gaaggtggta gcctgaaaat ttcattcaac tctaaatata tgatggatgc tttaaaagca 1020 

atcgataatg atgaggttga agttgaattc ttcggtacaa tgaaaccatt tattctaaaa 1080 

ccaaaaggtg acgactcggt aacgcaatta attttaccaa tcagaactta ctaa 1134 



This amino acid sequence of S. aureus beta subunit is as follows (SEQ. 

ID. No. 10): 



Met Met Glu Phe Thr lie Lys Arg Asp Tyr Phe lie Thr Gin Leu Asn 
15 10 15 

Asp Thr Leu Lys Ala lie Ser Pro Arg Thr Thr Leu Pro lie Leu Thr 
20 25 30 

Gly He Lys He Asp Ala Lys Glu His Glu Val He Leu Thr Gly Ser 
35 40 45 

Asp Ser Glu He Ser He Glu He Thr He Pro Lys Thr Val Asp Gly 
5 0 5 5 60 

Glu Asp He Val Asn He Ser Glu Thr Gly Ser Val Val Leu Pro Gly 
65 70 75 80 

Arg Phe Phe Val Asp He He Lys Lys Leu Pro Gly Lys Asp Val Lys 



Leu Ser Thr Asn Glu Gin Phe Gin Thr Leu He Thr Ser Gly His Ser 
100 105 110 

Glu Phe Asn Leu Ser Gly Leu Asp Pro Asp Gin Tyr Pro Leu Leu Pro 
115 120 125 

Gin Val Ser Arg Asp Asp Ala He Gin Leu Ser Val Lys Val Leu Lys 
130 135 140 

Asn Val He Ala Gin Thr Asn Phe Ala Val Ser Thr Ser Glu Thr Arg 
145 150 155 160 

Pro Val Leu Thr Gly Val Asn Trp Leu He Gin Glu Asn Glu Leu He 
165 170 175 

Cys Thr Ala Thr Asp Ser His Arg Leu Ala Val Arg Lys Leu Gin Leu 
180 185 190 

Glu Asp Val Ser Glu Asn Lys Asn Val He He Pro Gly Lys Ala Leu 
195 200 205 

Ala Glu Leu Asn Lys He Met Ser Asp Asn Glu Glu Asp He Asp He 
210 215 220 
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Phe Phe Ala Ser Asn Gin Val Leu Phe Lys Val Gly Asn Val Asn Phe 

225 230 235 240 

He Ser Arg Leu Leu Glu Gly His Tyr Pro Asp Thr Thr Arg Leu Phe 
245 250 255 

Pro Glu Asn Tyr Glu He Lys Leu Ser He Asp Asn Gly Glu Phe Tyr 
260 2S5 270 

His Ala He Asp Arg Ala Ser Leu Leu Ala Arg Glu Gly Gly Asn Asn 
275 280 285 

Val He Lys Leu Ser Thr Gly Asp Asp Val Val Glu Leu Ser Ser Thr 
290 295 300 

Ser Pro Glu He Gly Thr Val Lys Glu Glu Val Asp Ala Asn Asp Val 
305 310 315 320 

Glu Gly Gly Ser Leu Lys He Ser Phe Asn Ser Lys Tyr Met Met Asp 
325 330 335 

Ala Leu Lys Ala He Asp Asn Asp Glu Val Glu Val Glu Phe Phe Gly 
340 345 350 

Thr Met Lys Pro Phe He Leu Lys Pro Lys Gly Asp Asp Ser Val Thr 
355 360 365 

Gin Leu He Leu Pro He Arg Thr Tyr 
370 375 

This invention also relates to the S. aureus holA gene encoding the 
delta subunit. The partial nucleotide sequence of this holA gene corresponds to SEQ. 
ID. No. 1 1 as follows: 



atggatgaac 


agcaacaatt 


gacgaatgca 


tatcattcaa 




gcatgcctat 


60 




gtgatgatgc 


acaaacgatg 


aaacaagttg 




tgcaaagctt 


120 


attttatgtc 


aaacagatag 




acaaaggt ta 




tcatccagac 


180 


tttatgtata 




tgagaatgca 


attaagaaag 




acaacttgtg 


240 


cgtcatatga 


atcaacttcc 


tatagaaagc 




tgtacatcat 


cgaagacttt 


300 


gaagactttg 


aaaagttaac 


tgttcaaggg 


gaaaacagta 


tcttgaaatt 


tcttgaagaa 


360 


ccaccggaca 


atacgattgc 


tattttattg 




ctgagcaaat 


tttagacaca 


420 


atccattcaa 


ggtgtcagca 


tgtatatttc 


aagcctattg 


ataaagaaaa 


gtttataaat 


480 


agattagttg 


aacaaaacat 


gtctaagcca 


gtagctgaaa 






540 


caaatagata 


atgcaatggc 


tttaaatgaa 


gaatttgatt 


tattagcatt 


aaggaaatca 


600 


gttatacgtt 


gggaattgtt 




aagccaatgg 


cacttatagg 


tattattgat 


660 


ttattgaaac 


aggctaaaaa 


taaaaaactg 


caatctttaa 


ctattgcagc 


tgtgaatggt 


720 


ttcttcgaag 


atatcataca 


tacaaaggta 


aatgtagagg 


ataaacaaat 


atatagtgat 


780 


ttaaaaaatg 


atattgatca 


atatgcgcaa 


aagttgtcgt 


ttaatcaatt 


aattttgatg 


840 


tttgatcaac 


tgacggaagc 


acataagaaa 




atgtaaatcc 


aacgcttgta 


900 


tttgaacaaa 


tcgtaattaa 


gggtgtgagt 








930 



The amino acid sequence of the delta subunit encoded by 5. aureus 
holA corresponds to SEQ. ID. No. 12 as follows: 



Met Asp Glu Gin Gin Gin Leu Thr Asn Ala Tyr His Ser Asn Lys Leu 
15 10 15 
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Ser His Ala Tyr Leu Phe Glu Gly Asp Asp Ala Gin Thr Met Lys Gin 
20 25 30 

Val Ala lie Asn Phe Ala Lys Leu lie Leu Cys Gin Thr Asp Ser Gin 
35 40 45 

Cys Glu Thr Lys Val Ser Thr Tyr Asn His Pro Asp Phe Met Tyr He 
50 55 60 

Ser Thr Thr Glu Asn Ala He Lys Lys Glu Gin Val Glu Gin Leu Val 
65 70 75 80 

Arg His Met Asn Gin Leu Pro lie Glu Ser Thr Asn Lys Val Tyr He 
85 90 95 



Arg Leu Val Glu Gin Asn Met Ser Lys Pro Val Ala Glu Met He Ser 
165 170 175 



Asp Leu Leu Ala Leu Arg Lys Ser Val lie Arg Trp Glu Leu Leu Leu 
195 200 205 



Ser Phe Asn Gin Leu He Leu Met Phe Asp Gin Leu Thr Glu Ala His 

275 280 285 



This invention also relates to the S. aureus holB gene encoding the 
delta prime subunit. The partial nucleotide sequence of this holB gene corresponds to 
SEQ. ED. No. 13 as follows: 
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atgagcgaca atattgtagc tatttatgga 
gcagaaatca tatcacaatt tttgaaaagt 
aatttatacg aaacagagat tgcaccaatt 
tcagataaaa aagcaatttt ggttaaaaat 
aaagatatgg ctcataatgt agaccaatta 
aatttgattg tctttgagat acatcaaaat 
actctaaaaa agcatgcaag gcttaaaaaa 
attcaaaaaa aagaacaagc gattgatttt 
ccaattaaac ttcttgcact tacatcaaat 
tcacaaaaag gttatagtgg tcaacaaatt 
gtgaaacttg cacttggtca agtgagacat 
gatgcatgtg cagaaacaga ttataaactt 
cttgaacttt ttattctttc actt 



gatgtgcctg aattggtcga aaaacaaagt 60 
gatagagatg actttaactt tgtgaaatat 120 
gtcgaagaaa cattaacatt gccttccctt 180 
gcatatatat ttacaggtga aaaagcgcca 240 
atagaattta ttgaaaaata tgatggcgaa 300 
aaacttgatg aaagaaaaaa gttaactaaa 360 
atagagcaga tgtcggagga gatcaagtgg 420 
gtaaaagatc ttataacaat gaaagaagaa 480 
tatagacttt tttatcaatg taaaattctt 540 
gcaaaaacaa taggtgttca tccatataga 600 
tatcaacttg atgaacttct taatattatt 660 
aaatcatcat atatggataa acaacttatt 720 
744 



The amino acid sequence of the delta prime subunit encoded by S. 
aureus holB corresponds to SEQ. ED. No. 14 as follows: 



Met Ser Asp Asn lie Val Ala lie Tyr Gly Asp Val Pro Glu Leu Val 
15 10 15 

Glu Lys Gin Ser Ala Glu He He Ser Gin Phe Leu Lys Ser Asp Arg 
20 25 30 

Asp Asp Phe Asn Phe Val Lys Tyr Asn Leu Tyr Glu Thr Glu He Ala 
35 40 45 

Pro He Val Glu Glu Thr Leu Thr Leu Pro Phe Phe Ser Asp Lys Lys 
50 55 60 

Ala He Leu Val Lys Asn Ala Tyr He Phe Thr Gly Glu Lys Ala Pro 
65 70 75 " 80 

Lys Asp Met Ala His Asn Val Asp Gin Leu He Glu Phe He Glu Lys 
85 90 95 

Tyr Asp Gly Glu Asn Leu He Val Phe Glu He Tyr Gin Asn Lys Leu 
100 105 110 

Asp Glu Arg Lys Lys Leu Thr Lys Thr Leu Lys Lys His Ala Arg Leu 
115 120 125 

Lys Lys He Glu Gin Met Ser Glu Glu He Lys Trp He Gin Lys Lys 
130 135 140 

Glu Gin Ala He Asp Phe Val Lys Asp Leu He Thr Met Lys Glu Glu 
145 150 155 160 

Pro He Lys Leu Leu Ala Leu Thr Ser Asn Tyr Arg Leu Phe Tyr Gin 
165 170 175 

Cys Lys He Leu Ser Gin Lys Gly Tyr Ser Gly Gin Gin He Ala Lys 
180 185 ' 190 

Thr He Gly Val His Pro Tyr Arg Val Lys Leu Ala Leu Gly Gin Val 
195 200 205 

Arg His Tyr Gin Leu Asp Glu Leu Leu Asn He He Asp Ala Cys Ala 
210 215 220 
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Glu Thr Asp Tyr Lys Leu Lys Ser Ser Tyr Met Asp Lys Gin Leu lie 
225 230 235 240 

Leu Glu Leu Phe lie Leu Ser Leu 
245 

This invention also relates to the 5". aureus dnaG gene encoding a 
primase. The partial nucleotide sequence of this dnaG gene corresponds to SEQ. ED. 
No. 15 as follows: 

atgataggtt tgtgtccttt tcatgatgaa aagacacctt catttacagt ttctgaagat 60 

aaacaaatct gtcattgttt tggttgtaaa aaaggtggca atgtttttca atttactcaa 120 

gaaattaaag acatatcatt tgttgaagcg gttaaagaat taggtgatag agttaatgtt 180 

gctgtagata ttgaggcaac acaatctaac tcaaatgttc aaattgcttc tgatgattta 240 

caaatgattg aaatgcatga gttaatacaa gaattttatt attacgcttt aacaaagaca 300 

gtcgaaggcg aacaagcatt aacatactta caagaacgtg gttttacaga tgcgcttatt 360 

aaagagcgag gcattggctt tgcacccgat agctcacatt tttgtcatga ttttcttcaa 420 

aaaaagggtt acgatattga attagcatat gaagccggat tattatcacg taacgaagaa 480 

aatttcagtt attacgatag atttcgaaat cgtattatgt ttcctttgaa aaatgcgcaa 540 

ggaagaateg ttggatattc aggtcgaaca tataccggtc aagaaccaaa acacctaaat 600 

agtcctgaaa cgcctatctt tcaaaaaaga aagttgttat ataacttaga taaagcacgt 660 

aaatcaatta gaaaattaga tgaaattgta ttactagaag gttttatgga tgttataaaa 720 

tctgatactg ctggcttgaa aaacgttgtt gcaacaatgg gtacacagtt gtcagatgaa 780 

catattacct ttatacgaaa gttaacatca aatataacat taatgtttga tggggatttt 840 

gcgggtagtg aagcaacact taaaacaggt caacatttgt tacagcaagg gctaaatgta 900 

tttgttatac aattgccatc tggcatggat ccggatgaat acattggtaa gtatggcaac 960 

gacgcattta ctacttttgt aaaaaatgac aaaaagtcat ttgcacatta taaagtaagt 1020 

atattaaaag atgaaattgc acataatgac ctttcatatg aacgttattt gaaagaactg 1080 

agtcatgaca tttcacttat gaagtcatca attctgcaac aaaaggctat aaatgatgtt 1140 

gcgccatttt tcaatgttag tcctgagcag ttagctaacg aaatacaatt caatcaagca 1200 

ccagccaatt attatccaga agatgagtat ggcggttatg atgagtatgg cggttatatt 1260 

gaacctgagc caattggtat ggcacaattt gacaatttga gccgtcgaga aaaagcggag 1320 

cgagcatttt taaaacattt aatgagagat aaagatacat ttttaaatta ttatgaaagt 1380 

gttgataagg ataacttcac aaatcagcat tttaaatatg tattcgaagt cttacatgat 1440 

ttttatgcgg aaaatgatca atataatatc agtgatgctg tgcagtatgt taattcaaat 1500 

gagttgagag aaacactaat tagcttagaa caatataatt tgaatggcga accacatgaa 1560 

aatgaaattg atgattatgt caatgttatt aatgaaaaag gacaagaaac aattgagtca 1620 

ttgaatcata aattaaggga agctacaagg attggcgatg tagaattaca aaaatactat 1680 

ttacagcaaa ttgttgctaa gaataaagaa cgcatgtag 1719 

The amino acid sequence of primase encoded by S. aureus dnaG 
corresponds to SEQ. ID. No. 16 as follows: 

Met He Gly Leu Cys Pro Phe His Asp Glu Lys Thr Pro Ser Phe Thr 
15 10 15 

Val Ser Glu Asp Lys Gin He Cys His Cys Phe Gly Cys Lys Lys Gly 
20 25 30 

Gly Asn Val Phe Gin Phe Thr Gin Glu He Lys Asp He Ser Phe Val 
35 40 45 

Glu Ala Val Lys Glu Leu Gly Asp Arg Val Asn Val Ala Val Asp He 
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Glu Ala Thr Gin Ser Asn Ser Asn Val Gin He Ala Ser Asp Asp Leu 
65 70 75 80 

Gin Met He Glu Met His Glu Leu He Gin Glu Phe Tyr Tyr Tyr Ala 
85 90 95 

Leu Thr Lys Thr Val Glu Gly Glu Gin Ala Leu Thr Tyr Leu Gin Glu 
100 105 110 

Arg Gly Phe Thr Asp Ala Leu He Lys Glu Arg Gly He Gly Phe Ala 
115 120 125 

Pro Asp Ser Ser His Phe Cys His Asp Phe Leu Gin Lys Lys Gly Tyr 
130 135 140 

Asp He Glu Leu Ala Tyr Glu Ala Gly Leu Leu Ser Arg Asn Glu Glu 
145 150 155 160 

Asn Phe Ser Tyr Tyr Asp Arg Phe Arg Asn Arg He Met Phe Pro Leu 
165 170 175 

Lys Asn Ala Gin Gly Arg He Val Gly Tyr Ser Gly Arg Thr Tyr Thr 
180 185 190 

Gly Gin Glu Pro Lys Tyr Leu Asn Ser Pro Glu Thr Pro lie Phe Gin 
195 200 205 



Lys Leu Asp Glu He Val Leu Leu Glu Gly Phe Met Asp Val He Lys 
225 230 235 240 

Ser Asp Thr Ala Gly Leu Lys Asn Val Val Ala Thr Met Gly Thr Gin 
245 250 255 

Leu Ser Asp Glu His He Thr Phe He Arg Lys Leu Thr Ser Asn He 
260 265 270 

Thr Leu Met Phe Asp Gly Asp Phe Ala Gly Ser Glu Ala Thr Leu Lys 
275 280 285 

Thr Gly Gin His Leu Leu Gin Gin Gly Leu Asn Val Phe Val lie Gin 
290 295 300 

Leu Pro Ser Gly Met Asp Pro Asp Glu Tyr He Gly Lys Tyr Gly Asn 
305 310 315 320 

Asp Ala Phe Thr Thr Phe Val Lys Asn Asp Lys Lys Ser Phe Ala His 
325 330 335 

Tyr Lys Val Ser lie Leu Lys Asp Glu lie Ala His Asn Asp Leu Ser 
340 345 350 

Tyr Glu Arg Tyr Leu Lys Glu Leu Ser His Asp He Ser Leu Met Lys 
355 360 365 



Asn Val Ser Pro Glu Gin Leu Ala Asn Glu lie Gin Phe Asn Gin Ala 
385 390 395 400 
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Pro Ala Asr. Tyr Tyr Pro Glu Asp Glu Tyr Gly Gly Tyr Asp Glu Tyr 
405 410 415 

Gly Gly Tyr He Glu Pro Glu Pro He Gly Met Ala Gin Phe Asp Asn 
420 425 430 

Leu Ser Arg Arg Glu Lys Ala Glu Arg Ala Phe Leu Lys His Leu Met 
435 440 445 

Arg Asp Lys Asp Thr Phe Leu Asn Tyr Tyr Glu Ser Val Asp Lys Asp 
450 455 460 

Asn Phe Thr Asn Gin His Phe Lys Tyr Val Phe Glu Val Leu His Asp 
465 470 475 480 

Phe Tyr Ala Glu Asn Asp Gin Tyr Asn He Ser Asp Ala Val Gin Tyr 
485 490 " 495 

Val Asn Ser Asn Glu Leu Arg Glu Thr Leu He Ser Leu Glu Gin Tyr 
500 505 510 

Asn Leu Asn Gly Glu Pro Tyr Glu Asn Glu He Asp Asp Tyr Val Asn 
515 520 525 

Val He Asn Glu Lys Gly Gin Glu Thr He Glu Ser Leu Asn His Lys 
530 535 540 

Leu Arg Glu Ala Thr Arg He Gly Asp Val Glu Leu Gin Lys Tyr Tyr 
545 550 555 560 

Leu Gin Gin He Val Ala Lys Asn Lys Glu Arg Met 

565 570 



This invention also relates to the polC gene of Streptococcus pyogenes 
encoding the a-large subunit. The partial nucleotide sequence of polC (a-large) 
corresponds to SEQ. ID. No. 17 as follows: 



atgtcagatt tattcgctaa attgatggac cagatagaaa tgccacttga catgagacgt 60 
tcaagtgcct tttcatctgc tgatattatc gaggtaaagg tacattcggt gtcacgcttg 120 
tgggaatttc attttgcctt tgcagcggtt ttaccgattg caacttatcg tgaattgcat 180 
gatcgtttga taagaacttt tgaggcggct gacattaagg taacctttga catccaagct 240 
gctcaggtgg attattcaga tgatctgctt caagcttatt accaagaagc ttttgagcat 300 
gcaccgtgta atagtgctag ttttaaatct tctttctcaa agctcaaagt gacttatgag 360 
gatgacaaac tcattattgc agcgccaggt tttgtgaata acgatcattt tagaaacaat 420 
catctgccta atctggtcaa gcaattagaa gcctttggct ttggcatctt gaccatagat 480 
atggtgtcag atcaggaaat gactgagcat ttgaccaaga attttgtttc cagtcgtcag 540 
gctcttgtga aaaaggctgt gcaggataat ttggaagccc aaaaatctct tgaagccatg 600 
atgccaccag ttgaggaagc cacacctgct cctaagtttg actacaagga acgagcagct 660 
aagcgtcagg cagggtttga aaaagcaacc atcacaccaa tgattgagat tgagaccgaa 720 
gaaaaccgga ttgtctttga gggtatggtt tttgacgtgg agcgtaaaac gactaggaca 780 
ggtcgccata tcatcaactt taaaatgaca gactatacct cctcgtttgc tctccaaaaa 840 
tgggctaaag acgatgagga gctccgtaaa tttgatatga ttgctaaggg agcttggtta 900 
cgggtacaag ggaatattga gaccaatcct tttacgaaga gtctcaccat gaatgtccag 960 
caggtcaaag aaattgtccg tcatgagcgc aaagacctga tgccagaagg gcaaaagcgg 1020 
gtcgaacttc atgcccacac caatatgtct accatggatg ccttaccgac agtagaaagc 1080 
ttgattgata cggcagccaa gtggggacac aaggcgattg ctatcaccga ccatgctaat 1140 
gtgcaaagtt ttcctcatgg ctaccatagg gctcgcaaag ctgggattaa ggctattttt 1200 
ggcctagaag ccaatattgt tgaggacaag gtgcctattt cttatgaacc tgttgatatg 1260 
gatttgcacg aagccaccta tgtggtcttt gacgtggaaa ccacaggtct atctgctatg 1320 
aataatgacc tgattcagat tgcggcttcc aaaatgttta aaggaaatat tgtagagcag 1380 
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tttgatgaat tcattgatcc tgggcatcct ctctcagcct ttaccaccga attgacagga 1440 
attaccgata agcatttgca gggcgccaag ccattggtta ctgtcctaaa agcttttcag 1500 
gacttttgca aagatagtat cttggttgcc cacaacgcca gctttgacgt gggctttatg 1560 
aacgccaact atgaacgcca cgacttgccc aaaatcacac agcctgtgat tgatacctta 1620 
gaatttgcta gaaacttgta tcctgagtac aagcgtcacg gtttgggacc gctcaccaag 1680 
cgtttccaag tgagtctaga ccaccatcat atggccaatt acgacgcgga agccacagga 1740 
cgtcttttgc ttatttttct aaaagatgcc agagaaaagc atggcatcaa aaatcttttg 1800 
caactcaata cagatttggt ggctgaggat tcttacaaaa aagcgcggat taagcatgcg 1860 
actatctatg tgcaaaatca ggttggtctt aaaaatatgt ttaagttggt cagcctttcc 1920 
aatatcaaat attttgaagg ggtgccgcgt attccaagaa ccgtcttaga tgctcacaga 1980 
gagggtttgt tacttggtac agcttgttct gacggcgagg tttttgatgc cgttctgact 2040 
aaaggaattg atgcagcggt tgatttggct aggtattatg attttatcga aatcatgcca 2100 
ccagccattt accagccatt ggttgtccgt gaattaatca aagatcaagc aggtattgag 2160 
caggtgattc gtgacctcat tgaagtaggg aaacgagcta agaaacctgt gcttgccact 2220 
gggaatgtgc attatctaga gcctgaagaa gagatttacc gtgaaattat tgtgcgtagt 2280 
cttggtcagg gtgccatgat taatagaaca atcggccgtg gggaaggggc acagcctgct 2340 
cctctaccta aagcgcactt tagaacaacc aatgaaatgc tggatgagtt tgcctttctt 2400 
ggaaaagacc tcgcttatca agtagttgtg caaaatactc aggattttgc ggaccgtatt 2460 
gaggaagtgg aagtggttaa gggcgatctt tacaccccgC atattgataa ggccgaagag 2520 
acggttgccg aattaaccta tcaaaaagcc tttgaaattt atggtaatcc tctcccagat 2580 
attattgatt tacgcattga aaaagagtta acctctatct tggggaacgg ttttgctgtg 2640 
atttatctcg cttcccaaat gcttgttaac cggtcaaatg agcgaggcta cctagttggt 2700 
tctaggggat ctgtagggtc tagctttgtc gccaccatga ttgggattac tgaggttaat 2760 
cctatgccgc ctcactacgt ttgcccgtcc tgccaacatt ctgaatttat cacagatggg 2820 
tcagttggat ctggctatga tttgcctaat aaaccctgtc cgaaatgtgg caccccttat 2880 
caaaaagatg ggcaagacat tccctttgag acctttcttg ggtttgatgg ggataaggtg 2940 
cccgatattg atttgaactt ctctggtgat gaccagccca gtgcccattt ggatgtccga 3000 
gatatttttg gtgacgaata cgcctttcgt gctggaacag ttggtaccgt agcagaaaaa 3060 
acagcttatg gatttgtcaa aggctatgaa cgcgactatg gcaagttcta tcgtgatgct 3120 
gaggtggatc gtctagcagc aggtgctgct ggtgtgaaac gaacgactgg gcagcaccct 3180 
ggggggattg ttgttattcc taattacatg gatgtttatg attttacccc cgtgcaatat 3240 
ccagccgatg atgtaacggc ttcttggcag acaactcact ttaacttcca tgatattgat 3300 
gaaaacgtct tgaaacttga tatcctaggg catgatgatc cgaccatgat tcgtaaactt 3360 
caggatttat cgggcattga tcctattact attcctgctg atgatccggg agttatggct 3420 
ctcttttctg ggacagaggt tttgggcgtt accccggaac aaattgggac accgactggt 3480 
atgctaggca ttccagaatt tggaaccaac tttgttcgcg gcatggttaa tgagacgcat 3540 
ccgaccactt ttgcggagct tttgcagttg tctggactat ctcatggaac cgatgtttgg 3600 
cttggtaatg cacaagattt gattaaagaa ggcattgcaa ccctaaaaac cgttatcggt 3660 
tgtcgtgacg acatcatggt ttacctcatg cacgcaggct tagaaccaaa aatggccttt 3720 
accattatgg agcgtgtgcg taagggatta Cggctaaaaa tttctgagga agaacgtaat 3780 
ggctatattg atgccatgcg agaaaacaat gtgcccgact ggtacattga atcgtgtgga 3840 
aaaatcaagt acatgttccc taaagcccat gcggcagctt atgttttgat ggcccttcgg 3900 
gtggcttatt tcaaggtgca ccaccccatt atgtattatt gtgcttattt ctctattcgt 3960 
gcgaaggctt ttgaattaaa aaccatgagt ggtggtttag atgctgttaa agcaagaatg 4020 
gaagatatta ctataaaacg taaaaataat gaagccacca atgtggaaaa tgacctcttt 4080 
acaaccttgg agattgtcaa cgaaatgtta gaacgcggct ttaagtttgg caaattagac 4140 
ctttacaaaa gtgatgctat agaattccaa atcaaaggag atacccttat ccctccattt 4200 
atagcgctag aaggtctggg tgaaaacgtg gccaagcaaa tcgttaaagc tcgtcaagaa 4260 
ggcgaattcc tctctaaaat ggaattgcgt aaacgaggcg gggcatcgtc aacgctcgtt 4320 
gagaaaatgg atgagatggg tattttagga aatatgccag aagataatca attaagtctt 4380 
tttgatgact ttttc 4395 



The encoded a-large subunit has an amino acid sequence corresponding to SEQ. ID. 
No. 18 as follows: 

Met Ser Asp Leu Phe Ala Lys Leu Met Asp Gin He Glu Met Pro Leu 
1 5 10 15 

Asp Met Arg Arg Ser Ser Ala Phe Ser Ser Ala Asp He He Glu Val 
20 25 30 
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Lys Val His Ser Val Ser Arg Leu Trp Glu Phe His Phe Ala Phe Ala 
35 40 45 

Ala Val Leu Pro He Ala Thr Tyr Arg Glu Leu His Asp Arg Leu He 
50 55 60 

Arg Thr Phe Glu Ala Ala Asp He Lys Val Thr Phe Asp He Gin Ala 
65 70 75 80 

Ala Gin Val Asp Tyr Ser Asp Asp Leu Leu Gin Ala Tyr Tyr Gin Glu 
85 90 " ' 95 

Ala Phe Glu His Ala Pro Cys Asn Ser Ala Ser Phe Lys Ser Ser Phe 
100 105 110 

Ser Lys Leu Lys Val Thr Tyr Glu Asp Asp Lys Leu He He Ala Ala 
115 120 125 

Pro Gly Phe Val Asn Asn Asp His Phe Arg Asn Asn His Leu Pro Asn 
130 135 140 

Leu Val Lys Gin Leu Glu Ala Phe Gly Phe Gly He Leu Thr He Asp 
145 150 155 160 

Met Val Ser Asp Gin Glu Met Thr Glu His Leu Thr Lys Asn Phe Val 
165 170 ' 175 

Ser Ser Arg Gin Ala Leu Val Lys Lys Ala Val Gin Asp Asn Leu Glu 
180 185 190 

Ala Gin Lys Ser Leu Glu Ala Met Met Pro Pro Val Glu Glu Ala Thr 
195 200 205 

Pro Ala Pro Lys Phe Asp Tyr Lys Glu Arg Ala Ala Lys Arg Gin Ala 
210 215 220 

Gly Phe Glu Lys Ala Thr He Thr Pro Met He Glu He Glu Thr Glu 
225 230 235 240 

Glu Asn Arg He Val Phe Glu Gly Met Val Phe Asp Val Glu Arg Lys 
245 250 ~ 255 

Thr Thr Arg Thr Gly Arg His He He Asn Phe Lys Met Thr Asp Tyr 
260 265 270 

Thr Ser Ser Phe Ala Leu Gin Lys Trp Ala Lys Asp Asp Glu Glu Leu 
275 280 285 

Arg Lys Phe Asp Met He Ala Lys Gly Ala Trp Leu Arg Val Gin Gly 
2 90 2 95 3 00 

Asn He Glu Thr Asn Pro Phe Thr Lys Ser Leu Thr Met Asn Val Gin 
305 310 315 320 

Gin Val Lys Glu lie Val Arg His Glu Arg Lys Asp Leu Met Pro Glu 
325 330 335 

Gly Gin Lys Arg Val Glu Leu His Ala His Thr Asn Met Ser Thr Met 
340 345 350 

Asp Ala Leu Pro Thr Val Glu Ser Leu He Asp Thr Ala Ala Lys Trp 
355 360 365 
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Gly His Lys Ala lie Ala lie Thr Asp His Ala Asn Val Gin Ser Phe 
370 375 380 

Pro His Gly Tyr His Arg Ala Arg Lys Ala Gly lie Lys Ala He Phe 
38d 390 395 400 

Gly Leu Glu Ala Asn He Val Glu Asp Lys Val Pro He Ser Tyr Glu 
405 410 415 

Pro Val Asp Met Asp Leu His Glu Ala Thr Tyr Val Val Phe Asp Val 
420 425 430 

Glu Thr Thr Gly Leu Ser Ala Met Asn Asn Asp Leu He Gin He Ala 
435 440 445 

Ala Ser Lys Met Phe Lys Gly Asn He Val Glu Gin Phe Asp Glu Phe 
450 455 460 

He Asp Pro Gly His Pro Leu Ser Ala Phe Thr Thr Glu Leu Thr Gly 
465 470 475 480 

He Thr Asp Lys His Leu Gin Gly Ala Lys Pro Leu Val Thr Val Leu 
485 490 495 

Lys Ala Phe Gin Asp Phe Cys Lys Asp Ser He Leu Val Ala His Asn 
500 505 510 

Ala Ser Phe Asp Val Gly Phe Met Asn Ala Asn Tyr Glu Arg His Asp 
515 520 525 

Leu Pro Lys He Thr Gin Pro Val He Asp Thr Leu Glu Phe Ala Arg 
530 535 540 

Asn Leu Tyr Pro Glu Tyr Lys Arg His Gly Leu Gly Pro Leu Thr Lys 
545 550 555 560 

Arg Phe Gin Val Ser Leu Asp His His His Met Ala Asn Tyr Asp Ala 
565 570 575 

Glu Ala Thr Gly Arg Leu Leu Phe He Phe Leu Lys Asp Ala Arg Glu 
580 585 590 

Lys His Gly He Lys Asn Leu Leu Gin Leu Asn Thr Asp Leu Val Ala 
595 600 605 

Glu Asp Ser Tyr Lys Lys Ala Arg He Lys His Ala Thr He Tyr Val 
610 615 620 

Gin Asn Gin Val Gly Leu Lys Asn Met Phe Lys Leu Val Ser Leu Ser 
625 630 635 640 

Asn He Lys Tyr Phe Glu Gly Val Pro Arg He Pro Arg Thr Val Leu 
645 650 ~ 655 

Asp Ala His Arg Glu Gly Leu Leu Leu Gly Thr Ala Cys Ser Asp Gly 
660 665 670 

Glu Val Phe Asp Ala Val Leu Thr Lys Gly He Asp Ala Ala Val Asp 
675 680 685 

Leu Ala Arg Tyr Tyr Asp Phe He Glu He Met Pro Pro Ala He Tyr 
690 695 700 
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Gin Pro Leu Val Val Arg Glu Leu lie Lys Asp Gin Ala Gly He Glu 

705 710 715 . 720 

Gin Val He Arg Asp Leu He Glu Val Gly Lys Arg Ala Lys Lys Pro 

725 730 735 

Val Leu Ala Thr Gly Asn Val His Tyr Leu Glu Pro Glu Glu Glu He 

740 745 750 



Tyr Arg Glu He He Val Arg Ser Leu Gly Gin Gly Ala Met He Asn 
755 760 765 

Arg Thr He Gly Arg Gly Glu Gly Ala Gin Pro Ala Pro Leu Pro Lys 
770 775 780 

Ala His Phe Arg Thr Thr Asn Glu Met Leu Asp Glu Phe Ala Phe Leu 
785 790 795 800 

Gly Lys Asp Leu Ala Tyr Gin Val Val Val Gin Asn Thr Gin Asp Phe 
805 810 815 

Ala Asp Arg He Glu Glu Val Glu Val Val Lys Gly Asp Leu Tyr Thr 
820 825 830 

Pro Tyr He Asp Lys Ala Glu Glu Thr Val Ala Glu Leu Thr Tyr Gin 
835 840 845 

Lys Ala Phe Glu He Tyr Gly Asn Pro Leu Pro Asp He He Asp Leu 
850 855 860 

Arg He Glu Lys Glu Leu Thr Ser He Leu Gly Asn Gly Phe Ala Val 
865 870 875 880 

He Tyr Leu Ala Ser Gin Met Leu Val Asn Arg Ser Asn Glu Arg Gly 
885 890 895 

Tyr Leu Val Gly Ser Arg Gly Ser Val Gly Ser Ser Phe Val Ala Thr 
900 905 910 

Met He Gly He Thr Glu Val Asn Pro Met Pro Pro His Tyr Val Cys 
915 920 925 



Pro Ser Cys Gin His Ser Glu Phe 
930 935 

Gly Tyr Asp Leu Pro Asn Lys Pro 
945 950 

Gin Lys Asp Gly Gin Asp He Pro 
965 

Gly Asp Lys Val Pro Asp He Asp 
980 

Pro Ser Ala His Leu Asp Val Arg 

995 1000 

Phe Arg Ala Gly Thr Val Gly Thr 
1010 1015 

Phe Val Lys Gly Tyr Glu Arg Asp 
1025 1030 



He Thr Asp Gly Ser Val Gly Ser 
94 0 

Cys Pro Lys Cys Gly Thr Pro Tyr 
955 960 

Phe Glu Thr Phe Leu Gly Phe Asp 
970 975 

Leu Asn Phe Ser Gly Asp Asp Gin 
985 990 

Asp He Phe Gly Asp Glu Tyr Ala 
1005 

Val Ala Glu Lys Thr Ala Tyr Gly 
1020 

Tyr Gly Lys Phe Tyr Arg 'Asp Ala 
1035 1040 
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Glu Val Asp Arg Leu Ala Ala Gly Ala Ala Gly Val Lys Arg Thr Thr 
1045 ' 1050 1055 

Gly Gin His Pro Gly Gly He Val Val He Pro Asn Tyr Met Asp Val 
1060 1065 1070 

Tyr Asp Phe Thr Pro Val Gin Tyr Pro Ala Asp Asp Val Thr Ala Ser 
1075 1080 1085 

Trp Gin Thr Thr His Phe Asn Phe His Asp He Asp Glu Asn Val Leu 
1090 1095 1100 

Lys Leu Asp He Leu Gly His Asp Asp Pro Thr Met He Arg Lys Leu 
1105 1110 1115 1120 

Gin Asp Leu Ser Gly lie Asp Pro He Thr He Pro Ala Asp Asp Pro 
1125 1130 1135 

Gly Val Met Ala Leu Phe Ser Gly Thr Glu Val Leu Gly Val Thr Pro 
1140 1145 1150 

Glu Gin He Gly Thr Pro Thr Gly Met Leu Gly He Pro Glu Phe Gly 
1155 1160 1165 

Thr Asn Phe Val Arg Gly Met Val Asn Glu Thr His Pro Thr Thr Phe 
1170 1175 1180 

Ala Glu Leu Leu Gin Leu Ser Gly Leu Ser His Gly Thr Asp Val Trp 
1185 1190 1195 ' 1200 

Leu Gly Asn Ala Gin Asp Leu He Lys Glu Gly He Ala Thr Leu Lys 
1205 1210 1215 

Thr Val He Gly Cys Arg Asp Asp He Met Val Tyr Leu Met His Ala 
1220 1225 1230 

Gly Leu Glu Pro Lys Met Ala Phe Thr He Met Glu Arg Val Arg Lys 
1235 1240 1245 

Gly Leu Trp Leu Lys He Ser Glu Glu Glu Arg Asn Gly Tyr He Asp 
1250 1255 1260 

Ala Met Arg Glu Asn Asn Val Pro Asp Trp Tyr He Glu Ser Cys Gly 
1265 1270 ~ 1275 1280 

Lys He Lys Tyr Met Phe Pro Lys Ala His Ala Ala Ala Tyr Val Leu 
1285 1290 1295 

Met Ala Leu Arg Val Ala Tyr Phe Lys Val His His Pro He Met Tyr 
1300 1305 1310 

Tyr Cys Ala Tyr Phe Ser He Arg Ala Lys Ala Phe Glu Leu Lys Thr 
1315 1320 1325 

Met Ser Gly Gly Leu Asp Ala Val Lys Ala Arg Met Glu Asp He Thr 
1330 1335 1340 

He Lys Arg Lys Asn Asn Glu Ala Thr Asn Val Glu Asn Asp Leu Phe 
1345 1350 1355 1360 

Thr Thr Leu Glu He Val Asn Glu Met Leu Glu Arg Gly Phe Lys Phe 
1365 1370 1375 
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Gly Lys Leu Asp Leu Tyr Lys Ser Asp Ala He Glu Phe Gin He Lys 
-380 1385 1390 

Gly Asp Thr Leu He Pro Pro Phe He Ala Leu Glu Gly Leu Glv Glu 
1395 1400 1405 

Asn Val Ala Lys Gin He Val Lys Ala Arg Gin Glu Gly Glu Phe Leu 
1410 1415 1420 

Ser Lys Met Glu Leu Arg Lys Arg Gly Gly Ala Ser Ser Thr Leu Val 
1425 1430 1435 1440 

Glu Lys Met Asp Glu Met Gly He Leu Gly Asn Met Pro Glu Asp Asn 
1445 1450 1455 

Gin Leu Ser Leu Phe Asp Asp Phe Phe 
1460 14G5 



The present invention also relates to the dnaE gene of Streptococcus 
pyogenes encoding the a-small subunit. The partial nucleotide sequence of the dnaE 
gene corresponds to SEQ. YD. No. 19 as follows: 



atgtttgctc aacttgatac taaaactgta 
aatcattatt ttgaacgagc aaagcaattt 
gataatcttt atggtgctta ccattttatt 
gttttaggtt tggaaataga gattctctat 
gcccagaata cacaaggcta tcatcagctt 
aagcttcata tggattactt ctgccaacat 
aagggttgga gcgatacatt agtggtccct 
actgatttat ctcatatgga ttctaagagg 
tttgcgcaag atgatatgga aaccctgcac 
ctggcagaga cccctgtggt agaaagtgat 
gccttctatc aaacacactg ccctcaagct 
atctattatg atttcgatac aaatttaaaa 
aagcaagaat tgcaagactt gactgaggct 
ccttatcaat cgcgcttact acatgaattg 
tattttttga ttgtgtggga tttacttcgc 
atgggacgtg gctcggcggc aggtagtcta 
gatccagttc aacatgattt gctatttgag 
cctgatattg atatcgatct tccagatatt 
aatcgttatg gtagcgacca ttcggcgcaa 
attcgtgatg ttttcaaacg gttcggggtt 
aaaattggtt ttaaagatag cttggctact 
gttattaata gtagaactga atttcaaaag 
aatccaagac aaacgtccat tcacgcagct 
aatcatattc ctctaaaatc gggcgatgac 
gtcgaagcta atggcctgtt aaaaatggat 
caaaaaatgc aagagaaggt tgctaaagac 
gatttagaag acccgcaaac gttggcactt 
caatttgaac aaaatggtgc tattaatctt 
gaaattgttg ccactaccag tctaaataga 
attaaacgaa gagaaggaca agaaaaaatt 
ttagagccaa cttacggtat tatgctttat 
tatgctggtt ttacgttagg caaggccgac 
ctacaagaaa tgcaaaaaat ggaagaagac 
gctgaagaaa cagctagagg actttttaaa 
aaccgcagcc atgcctttgc ctattcagct 
cattacccgg ctgtttttta cgatatcatg 
gatgctctag aatcagattt tcaagtagcg 
gataaaattg aagctagcaa gatttacatg 



tactcattta tggatagttt aattgactta 60 
ggttaccaca ccataggaat catggataag 120 
aaaggttgtc aaaaaaatgg actgcagcca 180 
caagagcggc aggtgctcct taacttaatc 240 
ttaaaaattt ccacggcaaa aatgtctggc 300 
ttggaaggga tagcggttat tattcctagt 360 
tttgactact atatgggtgt tgatcagtat 420 
cagcttatac ccctaaggac agttcgttat 480 
atgttgcatg ccattcgaga taacctcagt 540 
caagagttag cagattgtca acaactaacc 600 
ctacagaatt tagaagactt agtgtcagga 660 
ttgcctcatt ttaatagaga taagtctgcc 720 
ggtttgaagg aaaaaggatt gtggaaagag 780 
gtcattattt ctgacatggg ctttgatgat 840 
tttggacgca gtaaaggcta ttatatggga 900 
gtggcttatg ctctgaacat tacagggatt 960 
cgctttttaa acaaagaacg ttatagcatg 1020 
taccgttcag aatttctacg gtatgtccga 1080 
attgtgacct tttcaacctt tggccaggct 1140 
ccagaatacg aactgactaa tctcactaaa 1200 
gtctatgaaa agtcaatctc ttttaggcag 1260 
gcttttgcca ttgccaagcg tatcgaagga 1320 
ggtattgtga tgagtgatga tgccttgacc 1380 
atgatgatca cccagtatga tgctcatgcg 1440 
tttttggggt taagaaattt gacctttgtt 1500 
tacgggtgtc agattgatat tacagccatt 1560 
tttgctaaag gggataccaa gggaattttc 1620 
ttaaaacgga ttaagccaca acgttttgaa 1680 
ccaggggcaa gtgactatac cactaatttc 1740 
gatttgattg atcctgtgat tgctcccatt 1800 
caagaacaag ttatgcagat tgcacaggtt 1860 
ttgttaaggc gtgccatgtc taaaaaaaat 1920 
tttattgctt ctgctaagca cctagggaga 1980 
cggatggaaa aatttgcagg ttatggtttt 2040 
ttagcttttc aattggctta tttcaaagcc 2100 
atgaattatt ctagcagtga ctatatcaca 2160 
caagttacca ttaatagtat tccttacact 2220 
gggctgaaaa atattaaggg gttgccaagg 2280 
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gattttgctt attggattat cgagcaaaga ccatttaata gcgtagagga ttttctcact 2340 
agaactccag aaaaatatca aaaaaaggtt ttccttgagc ctctgataaa aataggtctg 2400 
tttgattgct ttgagcctaa ccgtaaaaaa attctggaca atttggatgg tttactggta 2460 
tttgttaatg agcttggttc tcttttttca gattcttcct ctagttgggt agatacgaaa 2520 
gattactcag taactgaaaa atattctttg gaacaggaga tcgttggagt tggcatgagc 2580 
aagcatcctt taattgatat tgctgagaaa agtacccaaa cttttactcc tatttcacag 2640 
ttagtcaaag aaagcgaagc agtcgtactg attcaaatag atagcattag gatcattaga 2700 
accaaaacaa gtgggcagca aatggctttt ttaagtgtga atgacactaa gaaaaagctc 2760 
gatgtcacac tttctccaca agagtatgcc atttataaag accaattaaa agaaggagaa 2820 
ttctattact taaaaggtag aataaaagaa agagaccatc gactgcagat ggtgtgtcag 2880 
caagtgcaaa tggctattag tcaaaaatat tggttattag ttgaaaacca tcagtttgat 2940 
tcccaaattt ctgagacttt aggtgccttt ccaggaacga ctccagttgt tattcactat 3000 
caaaaaaata aggaaacaat tgcattaact aagattcagg ttcatgtaac agagaattta 3060 
aaggaaaaac ttcgtccttt tgttctgaaa acggtttttc ga 3102 



The encoded a-small subunit has an amino acid sequence corresponding to SEQ. ID. 
No. 20 as follows: 



Met Phe Ala Gin Leu Asp Thr Lys Thr Val Tyr Ser Phe Met Asp Ser 
15 10 15 

Leu lie Asp Leu Asn His Tyr Phe Glu Arg Ala Lys Gin Phe Gly Tyr 
20 25 30 

His Thr lie Gly lie Met Asp Lys Asp Asn Leu Tyr Gly Ala Tyr His 
35 40 45 

Phe lie Lys Gly Cys Gin Lys Asn Gly Leu Gin Pro Val Leu Gly Leu 
50 55 60 

Glu lie Glu lie Leu Tyr Gin Glu Arg Gin Val Leu Leu Asn Leu lie 
65 70 75 80 

Ala Gin Asn Thr Gin Gly Tyr His Gin Leu Leu Lys lie Ser Thr Ala 
85 90 95 

Lys Met Ser Gly Lys Leu His Met Asp Tyr Phe Cys Gin His Leu Glu 
100 105 110 

Gly He Ala Val He He Pro Ser Lys Gly Trp Ser Asp Thr Leu Val 
115 120 125 

Val Pro Phe Asp Tyr Tyr Met Gly Val Asp Gin Tyr Thr Asp Leu Ser 
130 135 140 

His Met Asp Ser Lys Arg Gin Leu He Pro Leu Arg Thr Val Arg Tyr 
145 150 155 160 

Phe Ala Gin Asp Asp Met Glu Thr Leu His Met Leu His Ala He Arg 
165 170 175 

Asp Asn Leu Ser Leu Ala Glu Thr Pro Val Val Glu Ser Asp Gin Glu 
180 185 190 

Leu Ala Asp Cys Gin Gin Leu Thr Ala Phe Tyr Gin Thr His Cys Pro 
195 200 205 

Gin Ala Leu Gin Asn Leu Glu Asp Leu Val Ser Gly He Tyr Tyr Asp 
210 215 22 0 
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Phe Asp Thr Asn Leu Lys Leu Pro His Phe Asn Arg Asp Lys Ser Ala 

225 230 235 240 

Lys Gin Glu Leu Gin Asp Leu Thr Glu Ala Gly Leu Lys Glu Lys Gly 

245 250 255 



lie Ser Asp Met Gly Phe Asp Asp Tyr Phe Leu lie Val Trp Asp Leu 
275 280 285 



Ser Ala Ala Gly Ser Leu Val Ala Tyr Ala Leu Asn lie Thr Gly lie 
305 310 315 320 



Ala Gin lie Val Thr Phe Ser Thr Phe Gly Pro Lys Gin Ala He Arg 
370 375 360 

Asp Val Phe Lys Arg Phe Gly Val Pro Glu Tyr Glu Leu Thr Asn Leu 

385 390 395 400 



He Pro Leu Lys Ser Gly Asp Asp Met Met He Thr Gin Tyr Asp Ala 

465 470 475 480 

His Ala Val Glu Ala Asn Gly Leu Leu Lys Met Asp Phe Leu Gly Leu 

485 490 495 



Tyr Gly Cys Gin He Asp He Thr Ala He Asp Leu Glu Asp Pro Gin 

515 520 525 

Thr Leu Ala Leu Phe Ala Lys Gly Asp Thr Lys Gly He Phe Gin Phe 
530 535 540 

Glu Gin Asn Gly Ala He Asn Leu Leu Lys Arg He Lys Pro Gin Arg 

545 550 555 560 
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Phe Glu Glu lie Val Ala Thr Thr Ser Leu Asn Arg Pro Gly Ala Ser 

565 570 575 

Asp Tyr Thr Thr Asn Phe He Lys Arg Arg Glu Gly Gin Glu Lys He 
580 585 590 

Asp Leu He Asp Pro Val He Ala Pro He Leu Glu Pro Thr Tyr Gly 
595 600 605 

lie Met Leu Tyr Gin Glu Gin Val Met Gin He Ala Gin Val Tyr Ala 
610 615 620 

Gly Phe Thr Leu Gly Lys Ala Asp Leu Leu Arg Arg Ala Met Ser Lys 
625 630 635 640 

Lys Asn Leu Gin Glu Met Gin Lys Met Glu Glu Asp Phe He Ala Ser 
645 650 655 

Ala Lys His Leu Gly Arg Ala Glu Glu Thr Ala Arg Gly Leu Phe Lys 
660 665 670 

Arg Met Glu Lys Phe Ala Gly Tyr Gly Phe Asn Arg Ser His Ala Phe 
675 680 685 

Ala Tyr Ser Ala Leu Ala Phe Gin Leu Ala Tyr Phe Lys Ala His Tyr 
690 695 700 

Pro Ala Val Phe Tyr Asp He Met Met Asn Tyr Ser Ser Ser Asp Tyr 
7 °5 710 715 720 

He Thr Asp Ala Leu Glu Ser Asp Phe Gin Val Ala Gin Val Thr He 
725 730 735 

Asn Ser He Pro Tyr Thr Asp Lys He Glu Ala Ser Lys He Tyr Met 
740 745 750 

Gly Leu Lys Asn He Lys Gly Leu Pro Arg Asp Phe Ala Tyr Trp He 
755 760 765 

He Glu Gin Arg Pro Phe Asn Ser Val Glu Asp Phe Leu Thr Arg Thr 
770 775 780 

Pro Glu Lys Tyr Gin Lys Lys Val Phe Leu Glu Pro Leu He Lys He 
7 85 790 795 800 

Gly Leu Phe Asp cys Phe Glu Pro Asn Arg Lys Lys He Leu Asp Asn 
805 810 815 

Leu Asp Gly Leu Leu Val Phe Val Asn Glu Leu Gly Ser Leu Phe Ser 
B20 825 830 

Asp Ser Ser Phe Ser Trp Val Asp Thr Lys Asp Tyr Ser Val Thr Glu 
835 840 845 



Pro Leu He Asp He Ala Glu Lys Ser Thr Gin Thr Phe Thr Pro He 

865 870 875 880 

Ser Gin Leu Val Lys Glu Ser Glu Ala Val Val Leu He Gin He Asp 

885 890 895 



WO 01/09164 



-56- 



PCT/LSOO/20666 



Ser He Arg He He Arg Thr Lys Thr Ser Gly Gin Gin Met Ala Phe 
900 905 910 

Leu Ser Val Asn Asp Thr Lys Lys Lys Leu Asp Val Thr Leu Phe Pro 
915 920 925 

Gin Glu Tyr Ala He Tyr Lys Asp Gin Leu Lys Glu Gly Glu Phe Tyr 
930 935 940 

Tyr Leu Lys Gly Arg He Lys Glu Arg Asp His Arg Leu Gin Met Val 
945 950 955 960 

Cys Gin Gin Val Gin Met Ala He Ser Gin Lys Tyr Trp Leu Leu Val 
965 970 975 

Glu Asn His Gin Phe Asp Ser Gin He Ser Glu He Leu Gly Ala Phe 
980 985 990 

Pro Gly Thr Thr Pro Val Val He His Tyr Gin Lys Asn Lys Glu Thr 
995 1000 1005 

He Ala Leu Thr Lys He Gin Val Thr Glu Asn Leu Lys Glu Lys Leu 
1010 1015 1020 

Arg Pro Phe Val Leu Lys Thr Val Phe Arg 
1025 1030 



The present invention also relates to the holA gene of Streptococcus 
pyogenes encoding the 5 subunit. The holA gene has a nucleotide sequence which 
corresponds to SEQ. ED. No. 21 as follows: 



atgattgcga tagaaaagat tgaaaaactg 
gtcacaggag atgacattgg tcagtatagc 
gcttttgata aggatgattt ggcctattct 
gatgcagaaa tggatctagt gagcctaccc 
gaccatttgt tagatatcac gaccaataaa 
gcctttgaag cctatttaga aaatccctta 
ggtaaattgg atagtaagag acggcttgtt 
gaagccaacc ctctgaaaga agcagagcta 
ctgggtttag gtttcgagag tggtgccttt 
tttagtcaaa tcatgaaaaa catggccttt 
agcctaactg atattgagca agccattcct 
actagacttg tcctaggagg taaaattgat 
ttatctggag aagatgacat taaattaatc 
ttgcagctga ctattcttgc tagagatgta 
tcagatattc ttgggcggcg ggttaatcct 
aggaccttat ctcttgcctt tctaacagga 
cagataaaaa caggacttta tgagaagagt 
atgactcact ctcaaaaa 



agtaaagaaa atttgggtct tataaccctt 60 
cagttgaaat cccgcttaat ggagcagatt 120 
tactttgata tgtctgaggc cgcttatcag 180 
ttctttgctg agcagaaggt ggttattttt 240 
aaaagtttct taaaagaaaa agacctaaag 300 
gagactactc gactaattat ctttgcccca 360 
aagcttttga aacgtgatgc ccttgtttta 420 
agaacttatt ttcaaaaata cagtcatcaa 480 
gaccaattac ttttgaaatc aaacgatgat 540 
ttaaaagcct ataaaaaaac gggaaatatt 600 
aaaagtttac aagataatat tttcgatctg 660 
gcggctagag atttgattca tgatttacgg 720 
gctatcatgc taggccaatt tcgcttattt 780 
aaaaacgagc aacaactagt gattagttta 840 
taccaggtca agtatgcgtt aaaggattct 900 
gcggtgaaaa ccttgattga gacagattac 960 
tatctagttg atattgctct cttaaaaatc 1020 
1038 



The encoded 6 subunit has an amino acid sequence corresponding to SEQ. ID. No. 22 
as follows: 



Met He Ala He Glu Lys He Glu Lys Leu Ser Lys Glu Asn Leu Gly 
15 10 15 
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Leu He Thr Leu Val Thr Gly Asp Asp He Gly Gin Tyr Ser Gin Leu 

20 25 30 

Lys Ser Arg Leu Met Glu Gin He Ala Phe Asp Lys Asp Asp Leu Ala 
3 5 4 0 4 5 

Tyr Ser Tyr Phe Asp Met Ser Glu Ala Ala Tyr Gin Asp Ala Glu Met 
50 55 60 

Asp Leu Val Ser Leu Pro Phe Phe Ala Glu Gin Lys Val Val He Phe 
65 70 75 80 

Asp His Leu Leu Asp He Thr Thr Asn Lys Lys Ser Phe Leu Lys Glu 
85 90 95 

Lys Asp Leu Lys Ala Phe Glu Ala Tyr Leu Glu Asn Pro Leu Glu Thr 
100 105 110 

Thr Arg Leu He He Phe Ala Pro Gly Lys Leu Asp Ser Lys Arg Arg 
115 120 125 

Leu Val Lys Leu Leu Lys Arg Asp Ala Leu Val Leu Glu Ala Asn Pro 
130 135 140 

Leu Lys Glu Ala Glu Leu Arg Thr Tyr Phe Gin Lys Tyr Ser His Gin 
145 150 155 ' 160 

Leu Gly Leu Gly Phe Glu Ser Gly Ala Phe Asp Gin Leu Leu Leu Lys 
165 170 175 

Ser Asn Asp Asp Phe Ser Gin He Met Lys Asn Met Ala Phe Leu Lys 
180 185 190 

Ala Tyr Lys Lys Thr Gly Asn He Ser Leu Thr Asp He Glu Gin Ala 
195 200 205 

He Pro Lys Ser Leu Gin Asp Asn He Phe Asp Leu Thr Arg Leu Val 
210 215 220 

Leu Gly Gly Lys lie Asp Ala Ala Arg Asp Leu He His Asp Leu Arg 
225 230 235 240 

Leu Ser Gly Glu Asp Asp He Lys Leu He Ala He Met Leu Gly Gin 
245 250 255 

Phe Arg Leu Phe Leu Gin Leu Thr He Leu Ala Arg Asp Val Lys Asn 
260 265 270 

Glu Gin Gin Leu Val He Ser Leu Ser Asp He Leu Gly Arg Arg Val 
275 280 285 

Asn Pro Tyr Gin Val Lys Tyr Ala Leu Lys Asp Ser Arg Thr Leu Ser 
290 295 300 

Leu Ala Phe Leu Thr Gly Ala Val Lys Thr Leu He Glu Thr Asp Tyr 
305 310 315 320 

Gin He Lys Thr Gly Leu Tyr Glu Lys Ser Tyr Leu Val Asp He Ala 
325 330 335 

Leu Leu Lys He Met Thr His Ser Gin Lys 
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The present invention also relates to the holB gene of Streptococcus 
pyogenes encoding the 5' subunit. The holB gene has a nucleotide sequence which 
corresponds to SEQ. ID. No. 23 as follows: 



atggatttag 


cgcaaaaagc 


tcctaacgtt 


tatcaagctt 


ttcagacaat 




60 


gaccgtctga 


atcatgctta 


tcttttttcg 


ggtgattttg 


ctaatgaaga 


aatggctctt 


120 


tttttagcta 


aggtcatctt 


ttgtgaacag 


aaaaaggatc 


agacgccctg 


cgggcattgt 


180 


cgctcttgtc 


aattgattga 


acaaggagat 


tttgccgatg 


tgacggtatt 


ggaaccaaca 


240 


gggcaagtga 


ttaaaacgga 


tgtggtcaaa 


gaaatgatgg 


ctaacttttc 


tcagacagga 


300 


tatgaaaaca 


aacgacaagt 


ttttattatc 


aaagattgtg 


acaaaatgca 


tatcaatgcc 


360 


gctaatagct 


tgctaaaata 


cattgaggag 


cctcagggag 


aagcttacat 


atttttattg 


420 


accaatgatg 


ataacaaagt 


gcttccgacc 


attaaaagtc 


ggacacaggt 




480 


cctaaaaacg 




ttaccaattg 




agggattatt 


aaaccatcag 


540 


gctaagctag 


tagccaaact 


tgccacaaac 


accagtcatc 


tagaacgtct 


gttgcaaacg 


600 


agcaagcttt 


tagaactgat 




gagcgttttg 


tatctatttg 


gctgaaagat 


660 


cagttgcagg 


catatttagc 


gttgaaccgt 


ctggtacagt 


tagcaactga 


aaaagaagaa 


720 


caagatttag 


ttttgaccct 


tttgaccttg 


ctcttggcaa 


gagagcgtgc 


gcaaacgcct 


780 


ttgacacaat 


tggaggctgt 


ctatcaggct 


aggctcatgt 


ggcagagcaa 


tgttaatttt 


840 


caaaacacat 


tagaatatat 


ggtgatgtca 


gaa 






873 



The encoded 8' subunit has an amino acid sequence corresponding to SEQ. ID. No. 24 
as follows: 

Met Asp Leu Ala Gin Lys Ala Pro Asn Val Tyr Gin Ala Phe Gin Thr 
1 5 10 15 

He Leu Lys Lys Asp Arg Leu Asn His Ala Tyr Leu Phe Ser Gly Asp 
20 25 30 

Phe Ala Asn Glu Glu Met Ala Leu Phe Leu Ala Lys Val He Phe Cys 
35 40 45 

Glu Gin Lys Lys Asp Gin Thr Pro Cys Gly His Cys Arg Ser Cys Gin 
50 55 60 

Leu He Glu Gin Gly Asp Phe Ala Asp Val Thr Val Leu Glu Pro Thr 
65 70 75 80 

Gly Gin Val He Lys Thr Asp Val Val Lys Glu Met Met Ala Asn Phe 
85 90 95 

Ser Gin Thr Gly Tyr Glu Asn Lys Arg Gin Val Phe He He Lys Asp 
100 105 no 

Cys Asp Lys Met His He Asn Ala Ala Asn Ser Leu Leu Lys Tyr He 
115 120 125 

Glu Glu Pro Gin Gly Glu Ala Tyr He Phe Leu Leu Thr Asn Asp Asp 
130 135 140 

Asn Lys Val Leu Pro Thr He Lys Ser Arg Thr Gin Val Phe Gin Phe 
145 150 155 160 

Pro Lys Asn Glu Ala Tyr Leu Tyr Gin Leu Ala Gin Glu Lys Gly Leu 
165 170 175 

Leu Asn His Gin Ala Lys Leu Val Ala Lys Leu Ala Thr Asn Thr Ser 
180 185 190 
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His Leu Glu Arg Leu Leu Gin Thr Ser Lys Leu Leu Glu Leu lie Thr 
195 200 205 

Gin Ala Glu Arg Phe Val Ser He Trp Leu Lys Asp Gin Leu Gin Ala 
210 215 220 

Tyr Leu Ala Leu Asn Arg Leu Val Gin Leu Ala Thr Glu Lys Glu Glu 
225 230 235 240 

Gin Asp Leu Val Leu Thr Leu Leu Thr Leu Leu Leu Ala Arg Glu Arg 
245 250 255 

Ala Gin Thr Pro Leu Thr Gin Leu Glu Ala Val Tyr Gin Ala Arg Leu 
260 265 270 

Met Trp Gin Ser Asn Val Asn Phe Gin Asn Thr Leu Glu Tyr Met Val 
275 280 285 

Met Ser Glu 
290 



The present invention also relates to the dnaX gene of Streptococcus 
pyogenes encoding the x subunit. The dnaX gene has a nucleotide sequence which 
corresponds to SEQ. ID. No. 25 as follows: 



atgtatcaag ctctttatcg gaaataccgg 
tcggttattt ccacaacttt aaagcaggca 
cttttttcag gtcctagagg gactgggaaa 
atgaattgtc ctaaccaagt cgatggtgaa 
atcacgaatg gaagcttgga agatgtgatt 
gatgaaattc gtgacattcg agacaaatca 
gtttatatta ttgatgaggt tcacatgtta 
actttggaag aaccgacaga atgttgtctt 
ccagccacta ttttatctcg tgtgcaacgc 
attcgagagc atttagcctg ggttttggac 
ttaaatctca ttgcaaggcg agcagaagga 
caggctttga gcttgtcacc agataatcag 
ggttctattt ccatacttgc tctgggtgac 
acgcaagctc tggcagcctt agagaccatt 
gcgacagatt tattgaccta tctgcgtgat 
caacgtcagt cagctgtttt tgataccaat 
atgataacag ttgttactag tcatctccct 
tatgccgaaa tgatgactat ccaattagct 
ttgtcaggag agttaatctc agagattgaa 
caacaattgt cgcagctcca atcgcgtcct 
cctaaaacca caagctacag ggttgatcgg 
gttcgaaata gccaacaatc tcgacaatat 
attctagata acatttctgc ccaagacaga 
gcaaatagtg agaatgcgat tttggctttc 
agccgaaata atcttaatga tatgtttggt 
cccaatattc tggcagtacc aaggacagat 
caaatgaaat cgcaaaaaga cagtgttcaa 
gaagggtttg attttttgct cgataaaata 



agccaaacgt ttgacgaaat ggtgggacaa 60 
gttgaatctg gcaagattag ccatgcttat 120 
acaagtgcgg caaagatttt tgcaaaggcc 180 
ccctgtaatc aatgcgatat ttgccgagat 240 
gaaattgatg ctgcctcgaa taatggtgtt 300 
acctatgcgc caagtcgtgc gacttacaag 360 
tcaacagggg cttttaatgc gcttttgaaa 420 
tatcttggca acaacggaat gcataaaatt 480 
tttgaattca aagctattaa gcaaaaagct 540 
aaagaaggta ttgcctatga ggtggatgct 600 
ggcatgcgtg atgctttatc tattttagat 660 
gtcgccattg caattgccga agaaattaca 720 
tatgttcgat atgtctccca agaacaggct 780 
tatgatagtg ggaagagcat gagccgcttt 840 
ttattggtgg ttaaagctgg cggcgacaat 900 
ttgtctctct cgatagatcg tatattccaa 960 
gaaatcaaaa agggaaccca tcctcggatt 1020 
cagaaagagc agattttgtc ccaagtaaac 1080 
acgctcaaaa atgagttggc acaacttaaa 1140 
gattcactgg caagatctga taaaacgaaa 1200 
gttaccattt tgaaaatcat ggaagaaacg 1260 
ctagatgctc taaaaaatgc ttggaatgaa 1320 
gccttattga tgggctctga gcctgtctta 1380 
gaggctgcct ttaatgcaga acaagtcatg 1440 
aacattatga gtaaagctgc tggtttttct 1500 
tttcagcata ttcgtaagga atttgctcag 1560 
gaagaacaag aagtagcgct tgatattcca 1620 
aatactattg acgac 1665 



The encoded x subunit has an amino acid sequence corresponding to SEQ. ID. No. 26 
as follows: 
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Met Tyr Gin Ala Leu Tyr Arg Lys Tyr Arg Ser Gin Thr Phe Asp Glu 
1 5 10 15 

Met Val Gly Gin Ser Val He Ser Thr Thr Leu Lys Gin Ala Val Glu 
20 25 30 

Ser Gly Lys He Ser His Ala Tyr Leu Phe Ser Gly Pro Arg Gly Thr 
35 40 45 

Gly Lys Thr Ser Ala Ala Lys He Phe Ala Lys Ala Met Asn Cys Pro 
50 55 60 

Asn Gin Val Asp Gly Glu Pro Cys Asn Gin Cys Asp He Cys Arg Asp 
55 70 75 80 

lie Thr Asn Gly Ser Leu Glu Asp Val He Glu lie Asp Ala Ala Ser 



Asn Asn Gly Val Asp Glu He Arg Asp He Arg Asp Lys Ser Thr Tyr 
100 105 110 

Ala Pro Ser Arg Ala Thr Tyr Lys Val Tyr He He Asp Glu Val His 
115 120 125 

Met Leu Ser Thr Gly Ala Phe Asn Ala Leu Leu Lys Thr Leu Glu Glu 
130 135 140 

Pro Thr Glu Asn Val Phe He Leu Ala Thr Thr Glu Leu His Lys He 
145 150 155 160 

Pro Ala Thr He Leu Ser Arg Val Gin Arg Phe Glu Phe Lys Ala He 
165 170 175 

Lys Gin Lys Ala He Arg Glu His Leu Ala Trp Val Leu Asp Lys Glu 
180 185 190 

Gly He Ala Tyr Glu Val Asp Ala Leu Asn Leu He Ala Arg Arg Ala 
195 200 205 

Glu Gly Gly Met Arg Asp Ala Leu Ser He Leu Asp Gin Ala Leu Ser 
210 215 220 

Leu Ser Pro Asp Asn Gin Val Ala He Ala He Ala Glu Glu He Thr 
225 230 235 240 

Gly Ser lie Ser lie Leu Ala Leu Gly Asp Tyr Val Arg Tyr Val Ser 
245 250 255 

Gin Glu Gin Ala Thr Gin Ala Leu Ala Ala Leu Glu Thr lie Tyr Asp 
260 265 270 

Ser Gly Lys Ser Met Ser Arg Phe Ala Thr Asp Leu Leu Thr Tyr Leu 
275 280 285 

Arg Asp Leu Leu Val Val Lys Ala Gly Gly Asp Asn Gin Arg Gin Ser 
290 295 300 

Ala Val Phe Asp Thr Asn Leu Ser Leu Ser lie Asp Arg He Phe Gin 
305 310 315 320 

Met He Thr Val Val Thr Ser His Leu Pro Glu He Lys Lys Gly Thr 
325 330 335 
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His Pro Arg He Tyr Ala Glu Met Met Thr He Gin Leu Ala Gin Lys 
340 345 350 

Glu Gin He Leu Ser Gin Val Asn Leu Ser Gly Glu Leu He Ser Glu 
355 360 365 

lie Glu Thr Leu Lys Asn Glu Leu Ala Gin Leu Lys Gin Gin Leu Ser 
370 375 380 

Gin Leu Gin Ser Arg Pro Asp Ser Leu Ala Arg Ser Asp Lys Thr Lys 
385 390 395 ' 400 

Pro Lys Thr Thr Ser Tyr Arg Val Asp Arg Val Thr He Leu Lys He 
405 410 415 

Met Glu Glu Thr Val Arg Asn Ser Gin Gin Ser Arg Gin Tyr Leu Asp 
420 425 430 

Ala Leu Lys Asn Ala Trp Asn Glu lie Leu Asp Asn He Ser Ala Gin 
435 440 445 

Asp Arg Ala Leu Leu Met Gly Ser Glu Pro Val Leu Ala Asn Ser Glu 
450 455 460 

Asn Ala He Leu Ala Phe Glu Ala Ala Phe Asn Ala Glu Gin Val Met 
465 470 475 480 

Ser Arg Asn Asn Leu Asn Asp Met Phe Gly Asn He Met Ser Lys Ala 
485 490 495 

Ala Gly Phe Ser Pro Asn lie Leu Ala Val Pro Arg Thr Asp Phe Gin 
500 505 510 

His lie Arg Lys Glu Phe Ala Gin Gin Met Lys Ser Gin Lys Asp Ser 
515 520 525 

Val Gin Glu Glu Gin Glu Val Ala Leu Asp He Pro Glu Gly Phe Asp 
530 535 540 

Phe Leu Leu Asp Lys He Asn Thr He Asp Asp 
545 550 555 



The present invention also relates to the dnaN gene of Streptococcus 
pyogenes encoding the p subunit. The dnaN gene has a nucleotide sequence which 
corresponds to SEQ. ID. No. 27 as follows: 



atgattcaat tttcaattaa tcgcacatta 
gctattagca ctaaaaatgc cattcctatt 
acaggagtaa ctttaacagg gtctaacggt 
agtaatgaaa atgctggttt gctaattacc 
ttttttatta atattatttc aagtttgcca 
caacaccaag ttgttttaac cagtggtaaa 
gaccagtatc ctcgtctaca agaagtatca 
ttattgaagt ctattattgc tgaaacagct 
attttaacag gagttcatat tgtattaagt 
gactctcatc gtatgagcca acgtttaatc 
gtagttcttc caagtaaatc tttgagagaa 
accgttgagg tatttttctc accaagccaa 



tttattcatg ctttaaatac aactaaacgt 60 
ctttcatcaa taaaaattga agtcacttct 120 
caaatatcaa ttgaaaacac tattcctgta 180 
tctccaggag ctattttatt agaagctagt 240 
gatattagta taaatgttaa agaaattgaa 300 
tcagagatta ccttaaaagg aaaagatgtt 360 
acagaaaatc ctttgatttt aaaaacaaaa 420 
tttgcagcca gtttacaaga aagtcgtcct 480 
aatcataaag attttaaagc agtagcgact 54 0 
actttggaca atacttcagc agatttgatg 600 
ttttcagcag tatttacaga tgatattgag 660 
atcttgttca gaagtgaaca catttctttt 720 
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tatacacgcc tcttagaagg aaattatccc gatacagacc gtttattaat gacagaattt 780 

gagacggagg ttgttttcaa tacccaatcc cttcgccacg ctatggaacg tgccttcttg 840 

atttctaatg ctactcaaaa tggtactgtt aagcttgaga ttacccaaaa tcatatttca 900 

gctcatgtta acccacctga ggttggtaag gtaaacgagg atttagatat tgttagtcag 960 

tctggtagtg atttaactat cagcctcaac ccaacttacc ttattgagtc tctaaaagct 1020 

attaaaagtg aaacagtaaa aattcatttc ttatcaccag ttcgaccatt caccctaaca 1080 

ccaggcgatg aggaagaaag ttttatccaa ttaattacac cagtacgaac aaac 1134 

The encoded p subunil has an amino acid sequence corresponding to SEQ. ED. No. 28 
as follows: 



Met He Gin Phe Ser He Asn Arg Thr Leu Phe He His Ala Leu Asn 
15 10 15 

Thr Thr Lys Arg Ala He Ser Thr Lys Asn Ala He Pro He Leu Ser 
20 25 30 

Ser He Lys He Glu Val Thr Ser Thr Gly Val Thr Leu Thr Gly Ser 
35 40 45 

Asn Gly Gin He Ser He Glu Asn Thr He Pro Val Ser Asn Glu Asn 
50 55 60 

Ala Gly Leu Leu He Thr Ser Pro Gly Ala He Leu Leu Glu Ala Ser 
65 70 75 80 

Phe Phe He Asn He He Ser Ser Leu Pro Asp He Ser He Asn Val 
85 90 95 

Lys Glu He Glu Gin His Gin Val Val Leu Thr Ser Gly Lys Ser Glu 
100 105 110 



Ala Val Ala Thr Asp Ser His Arg Met Ser Gin Arg Leu He Thr Leu 
180 185 190 



Tyr Thr Arg Leu Leu Glu Gly Asn Tyr Pro Asp Thr Asp Arg Leu Leu 

245 250 255 

Met Thr Glu Phe Glu Thr Glu Val Val Phe Asn Thr Gin Ser Leu Arg 
260 265 270 
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His Ala Met Glu Arg Ala Phe Leu He Ser Asn Ala Thr Gin Asn Gly 
275 280 285 

Thr Val Lys Leu Glu He Thr Gin Asn His He Ser Ala His Val Asn 
290 295 300 



Ser Pro Glu Val Gly Lys Val Asn Glu Asp Leu Asp He Val Ser Gin 
305 310 315 320 

Ser Gly Ser Asp Leu Thr He Ser Phe Asn Pro Thr Tyr Leu He Glu 
325 330 335 

Ser Leu Lys Ala lie Lys Ser Glu Thr Val Lys He His Phe Leu Ser 
340 345 350 

Pro Val Arg Pro Phe Thr Leu Thr Pro Gly Asp Glu Glu Glu Ser Phe 
355 360 365 

He Gin Leu He Thr Pro Val Arg Thr Asn 
370 375 



The present invention also relates to the ssb gene of Streptococcus 
pyogenes encoding the single strand-binding protein (SSB). The ssb gene has a 
nucleotide sequence which corresponds to SEQ. ID. No. 29 as follows: 



atgattaata atgtagtact agttggtcgc atgaccaagg atgcagaact tcgttacaca 60 

ccaagtcaag tagctgtggc taccttcaca cttgctgtta accgtacctt taaaagccaa 120 

aatggtgaac gcgaggcaga tttcattaac tgtgtgatct ggcgtcaacc ggctgaaaat 180 

ttagcgaact gggctaaaaa aggtgctttg accggagtta cgggtcgtat tcatacacgt 240 

aactacgaaa accaacaagg acaacgtgtc tatgtaacag aagttgttgc agataatttc 300 

caaatgttgg aaagtcgtgc tacacgtgaa ggtggctcaa ctggctcatt taatggtggt 360 

tttaacaata acacttcatc atcaaacagt tactcagcgc ctgcacaaca aacgcctaac 420 

tttggaagag atgatagccc atttgggaac tcaaacccga tggatatctc agatgacgat 480 
cttccattct ag 492 



The encoded SSB protein has an amino acid sequence corresponding to SEQ. ID. 
No. 30 as follows: 



Met He Asn Asn Val Val Leu Val Gly Arg Met Thr Lys Asp Ala Glu 

1 5 10 ' 15 

Leu Arg Tyr Thr Pro Ser Gin Val Ala Val Ala Thr Phe Thr Leu Ala 

20 25 30 

Val Asn Arg Thr Phe Lys Ser Gin Asn Gly Glu Arg Glu Ala Asp Phe 

35 40 45 

lie Asn Cys Val He Trp Arg Gin Pro Ala Glu Asn Leu Ala Asn Trp 

5 0 5 5 60 

Ala Lys Lys Gly Ala Leu lie Gly Val Thr Gly Arg He Gin Thr Arg 

65 70 75 80 

Asn Tyr Glu Asn Gin Gin Gly Gin Arg Val Tyr Val Thr Glu Val Val 
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Ala Asp Asn Phe Gin Met Leu Glu 
100 

Ser Thr Gly Ser Phe Asn Glv Gly 

115 ' 120 

Asn Ser Tyr Ser Ala Pro Ala Gin 

130 135 

Asp Ser Pro Phe Gly Asn Ser Asn 

145 150 

Leu Pro Phe 



Ser Arg Ala Thr Arg Glu Gly Gly 
105 110 

Phe Asn Asn Asn Thr Ser Ser Ser 
125 

Gin Thr Pro Asn Phe Gly Arg Asp 
140 

Pro Met Asp He Ser Asp Asp Asp 
155 160 



The present invention also relates to the dnaG gene of Streptococcus 
pyogenes encoding the primase. The dnaG gene has a nucleotide sequence which 
corresponds to SEQ. ED. No. 31 as follows: 



atgggatttt 


tatggggagg 


tgacgatttg gcaattgaca 


aagaaatgat 


ttcccaagta 


60 


aaaaatagcg 


ttaatattgt 


cgatgtcatt 


ggagaagtgg 


tcaaactttc 


ccgatcaggg 


120 


cggcattacc 


tcgggctttg 


cccatttcat 


aaggaaaaga 


caccctcttt 


taatgttgtt 


180 


gaagacagac 


aattttttca 


ctgctttggc 


tgtggaaaat 


caggggatgt 


ttttaaattt 


240 


attgaggaat 


accgccaagt 




gaaagtgttc 


agattattgc 


cgataagact 


300 


ggtatgtcgc 




gccaagtcag 


gcagtacttg 






360 


aatcacgctt 


tgatgacact 


tcatgaggat 




tttaccatgc 


agttttgatg 


420 




ttggtcaaga 


agctaggaag 




agagaggctt 


ggatgaccaa 


480 


ttaattgagc 




tggtttagcc 


ccagatgagt 


cagattatct 


ttatcaagct 


540 


ctttctaaaa 


aatacgagga 


aggtcaattg 


gttgcttcag 




cttgtccgat 


600 


caatccaata 


ccatttacga 


cgcctttcga 


aatcgtatca 


tgtttccctt 


atcagatgac 


660 


cgagggcata 




ttcaggacgt 


atctggacgg 


cagctgatat 


ggaaaagaga 


720 


caggcaaagt 




aagaggaaca 


gttcttttta 




tgaattgtat 


780 


catctggaca 


aggcaaggcc 


tgttattgcc 


aaaacccatg 


aagtgtttct 


aatggaaggg 


840 


tttatggacg 


tgattgccgc 




ggctatgaaa 


atgctgttgc 


ttcaatgggg 


900 


acggcattga 


ctcaagaaca 


tgtcaatcac 


cttaagcaag 


tcactaaaaa 


agttgttttg 


960 


atttatgatg 


gtgacgatgc 


tggacaacat 


gctattgcaa 


aatcactaga 


attgcttaaa 


1020 


gattttgttg 


tcgaaattgt 


cagaatcccc 


aataaaatgg 


atcctgacga 


atttgtacaa 


1080 


cggcattccc 


cagaagcatt 


tgcagatttg 


cttaagcagt 


cacggatcag 


tagtgttgaa 


1140 


ttttttattg 




acctactaat 


gtagacaatt 


tgcaatcaca 


aattgtttat 


1200 


gtggagaaaa 


tggcaccatt 


gattgctcaa 


tcacoatcca 


tcacagctca 


acattcgtat 


1260 


attaacaaga 


ttgctgattt 


gttgccaaac 


tttgactatt 


ttcaagtaga 


acaatcagta 


1320 


aatgcattaa 


ggattcaaga 


taggcaaaaa 


catcaaggtc 


aaatagctca 


agccgtcagc 


1380 


aatcttgtga 




gccaaaaagt 


ttgacagcta 


ttgctaagac 


agaaagtcat 


1440 


ctcatgcatc ggctcttaca 


tcatgactat 


ttattaaatg 




tcgtgatgat 


1500 


ttttattttg 


atacctctac 


cttagaatta 


ctttatcaac 


ggctgaagca 


acaaggacac 


1560 


attacatctt 


atgatttgtc 


agagatgtca 


gaggaagtta 


accgtgctta 




1620 


ttagaagaaa 


accttcccaa 


agaagtagct 


cttggtgaga 


ttgatgatat 


tttatccaaa 


1680 


cgtgccaaac 


ttttagcaga 


gcgcgatctt 


cacaaacaag 


ggaaaaaagt 


tagagaatct 


1740 


agtaacaaag 


gcgatcatca 


agcggctcta 


gaagtactag 




tgcgcagaaa 


1800 


cgaaaaatgg 










1815 



The encoded primase has an amino acid sequence corresponding to SEQ. ID. No. 32 
as follows: 
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Met Gly Phe Leu Trp Gly Gly Asp Asp Leu Ala He Asp Lys Glu Met 

15 10 15 

He Ser Gin Val Lys Asn Ser Val Asn He Val Asp Val He Glv Glu 
20 25 30 

Val Val Lys Leu Ser Arg Ser Gly Arg His Tyr Leu Gly Leu Cys Pro 
35 40 45 

Phe His Lys Glu Lys Thr Pro Ser Phe Asn Val Val Glu Asp Arg Gin 
5 0 5 5 60 

Phe Phe His Cys Phe Gly Cys Gly Lys Ser Gly Asp Val Phe Lys Phe 
65 70 75 80 

He Glu Glu Tyr Arg Gin Val Pro Phe Leu Glu Ser Val Gin He He 
85 90 95 

Ala Asp Lys Thr Gly Met Ser Leu Asn He Pro Pro Ser Gin Ala Val 
100 105 HO 

Leu Ala Ser Gin His Lys His Pro Asn His Ala Leu Met Thr Leu His 
115 120 125 

Glu Asp Ala Ala Lys Phe Tyr His Ala Val Leu Met Thr Thr Thr He 
130 135 140 

Gly Gin Glu Ala Arg Lys Tyr Leu Tyr Gin Arg Gly Leu Asp Asp Gin 
145 150 155 ' 160 

Leu He Glu His Phe Asn He Gly Leu Ala Pro Asp Glu Ser Asp Tyr 
165 170 175 

Leu Tyr Gin Ala Leu Ser Lys Lys Tyr Glu Glu Gly Gin Leu Val Ala 
180 185 190 

Ser Gly Leu Phe His Leu Ser Asp Gin Ser Asn Thr He Tyr Asp Ala 
195 200 205 

Phe Arg Asn Arg He Met Phe Pro Leu Ser Asp Asp Arg Gly His He 
210 215 220 

He Ala Phe Ser Gly Arg He Trp Thr Ala Ala Asp Met Glu Lys Arg 
225 230 235 ~ 240 

Gin Ala Lys Tyr Lys Asn Ser Arg Gly Thr Val Leu Phe Asn Lys Ser 
245 250 255 

Tyr Glu Leu Tyr His Leu Asp Lys Ala Arg Pro Val He Ala Lys Thr 
260 265 270 

His Glu Val Phe Leu Met Glu Gly Phe Met Asp Val He Ala Ala Tyr 
275 280 ~ 285 

Arg Ser Gly Tyr Glu Asn Ala Val Ala Ser Met Gly Thr Ala Leu Thr 
290 295 300 

Gin Glu His Val Asn His Leu Lys Gin Val Thr Lys Lys Val Val Leu 
305 310 315 320 

He Tyr Asp Gly Asp Asp Ala Gly Gin His Ala He Ala Lys Ser Leu 
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Glu Leu Leu Lys Asp Phe Val Val Glu He Val Arg He Pro Asn Lys 
340 345 350 

Met Asp Pro Asp Glu Phe Val Gin Arg His Ser Pro Glu Ala Phe Ala 
355 360 365 



Tyr Leu Lys Pro Thr Asn Val Asp Asn Leu Gin Ser Gin He Val Tyr 
385 390 395 400 

Val Glu Lys Met Ala Pro Leu He Ala Gin Ser Pro Ser He Thr Ala 
405 410 415 

Gin His Ser Tyr He Asn Lys He Ala Asp Leu Leu Pro Asn Phe Asp 
420 425 430 

Tyr Phe Gin Val Glu Gin Ser Val Asn Ala Leu Arg He Gin Asp Arg 
435 440 445 



His Arg Asp Asp Phe Tyr Phe Asp Thr Ser Thr Leu Glu Leu Leu Tyr 
500 505 510 

Gin Arg Leu Lys Gin Gin Gly His He Thr Ser Tyr Asp Leu Ser Glu 
515 520 525 

Met Ser Glu Glu Val Asn Arg Ala Tyr Tyr Asn Val Leu Glu Glu Asn 
530 535 540 

Leu Pro Lys Glu Val Ala Leu Gly Glu He Asp Asp He Leu Ser Lys 
545 550 555 560 

Arg Ala Lys Leu Leu Ala Glu Arg Asp Leu His Lys Gin Gly Lys Lys 
565 570 575 

Val Arg Glu Ser Ser Asn Lys Gly Asp His Gin Ala Ala Leu Glu Val 
580 585 590 

Leu Glu His Phe He Ala Gin Lys 
595 600 

The present invention also relates to the dnaB gene of Streptococcus 
pyogenes encoding DnaB. The dnaB gene has a nucleotide sequence which 
corresponds to SEQ. ED. No. 33 as follows: 



atgaggttgc ctgaagtagc tgaattacga 
tctgttcttg ggtcaatctt tatctcacct 
agtccagacg atttttataa gtacgctcat 
agcgatcgta atgatgccat tgatgcaacc 



gttcaacccc aagatttact agcagagcaa 60 
gataagctga ttgcagtgag agaatttatc 120 
aaaattatct ttcgggcaat gattaccctc 180 
actataagaa caatcctaga tgatcaagat 240 
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gatctgcaaa gtattggtgg cttatcctat 
agtgctaatg cagaatatta tgctaaaatt 
actgctaggt tgacagaatc tgtcaaccta 
gttatcgctg gagttgagag agctttaatt 
tttcgcaaaa tttcagatgt gctaaaagtt 
cagacttcaa atgttacagg tttaccaact 
ggtttacacc cagatcaatt agttatttta 
tttgttctta atattgcgca aaatgtgggg 
tctttggaaa tgggtgctga aagtttagta 
gattcgcaca gtttaagaac agggcaactc 
gctcagggag ctttggcaga agcaccgatt 
actgaaatcc gcgcaagatc acggaaattg 
attgtaattg actacttaca gttgattaca 
gtttcagata tttcaagaca gcttaaaatc 
gccctaagtc agctttctcg tggcgttgag 
gatattcgtg aatcaggatc tattgagcag 
gacgattatt accgtaaaga atgtgatgat 
gaagttatcc tcgagaaaaa tagagctggg 
aaagaataca acaaattctc aagtatagcc 



attgttgaac tagttaatag tgtcccaact 300 
gtagctgaga aagctatgtt gcgtgatatt 350 
gcttatgatg aaattttaaa accagaagag 420 
gaactcaatg aacatagtaa tcgtagtggg 480 
aattacgagg ctttagaagc acgttctaag 540 
ggttttagag accttgacaa gattacaaca 600 
gctgctcggc cagcagtggg gaagactgcc 660 
actaagcaaa aaaagactgt tgctattttt 720 
gatcgtatgc ttgcagcaga aggaatggtt 7B0 
acagatcagg attggaataa tgtaacaatt 840 
tatattgacg atacgcccgg gattaaaatt 900 
tctcaagaag tggatggtgg tttaggtctc 960 
ggaactaaac ccgaaaatcg tcagcaagag 1020 
ctagctaaag aattgaaagt accagttatt 1080 
caaaggcaag ataaacgacc agttttatca 1140 
gatgccgata ttgtagcctt cttataccgg 1200 
gctgaagaag ctgttgaaga taacacaatt 1260 
gcgcgtggaa cagtcaaact gatgttccaa 1320 
cagtttgaag aaagataa 1368 



The encoded DnaB has an amino acid sequence corresponding to SEQ. ID. No. 34 as 
follows: 



Met Arg Leu Pro Glu Val Ala Glu Leu Arg Val Gin Pro Gin Asp Leu 
IS 10 15 

Leu Ala Glu Gin Ser Val Leu Gly Ser lie Phe lie Ser Pro Asp Lys 
20 25 30 

Leu lie Ala Val Arg Glu Phe lie Ser Pro Asp Asp Phe Tyr Lys Tyr 
35 40 45 

Ala His Lys lie lie Phe Arg Ala Met lie Thr Leu Ser Asp Arg Asn 
50 55 60 

Asp Ala lie Asp Ala Thr Thr lie Arg Thr lie Leu Asp Asp Gin Asp 
65 70 75 ' 80 

Asp Leu Gin Ser He Gly Gly Leu Ser Tyr He Val Glu Leu Val Asn 
85 90 95 

Ser Val Pro Thr Ser Ala Asn Ala Glu Tyr Tyr Ala Lys He Val Ala 
100 105 110 

Glu Lys Ala Met Leu Arg Asp He He Ala Arg Leu Thr Glu Ser Val 
115 120 125 

Asn Leu Ala Tyr Asp Glu He Leu Lys Pro Glu Glu Val He Ala Gly 
130 135 140 

Val Glu Arg Ala Gin Gly Ala Leu Ala Glu Ala Pro He Tyr He Asp 
145 150 155 " 160 

Asp Thr Pro Gly He Lys He Ala Leu He Glu Leu Asn Glu His Ser 
165 170 175 

Asn Arg Ser Gly Phe Arg Lys He Ser Asp Val Leu Lys Val Asn Tyr 
180 185 190 

Glu Ala Leu Glu Ala Arg Ser Lys Gin Thr Ser Asn Val Thr Gly Leu 
195 200 205 
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Pro Thr Gly Phe Arg Asp Leu Asp Lys lie Thr Thr Gly Leu Kis Pro 
210 215 220 

Asp Gin Leu Val He Leu Ala Ala Arg Pro Ala Val Gly Lys Thr Ala 
225 230 235 240 

Phe Val Leu Asn He Ala Gin Asn Val Gly Thr Lys Gin Lys Lys Thr 
245 250 255 

Val Ala He Phe Ser Leu Glu Met Gly Ala Glu Ser Leu Val Asp Arg 
260 265 270 

Met Leu Ala Ala Glu Gly Met Val Asp Ser His Ser Leu Arg Thr Gly 
275 280 285 

Gin Leu Thr Asp Gin Asp Trp Asn Asn Val Thr He Thr Glu He Arg 
290 295 300 

Ala Arg Ser Arg Lys Leu Ser Gin Glu Val Asp Gly Gly Leu Gly Leu 
305 310 315 320 

He Val He Asp Tyr Leu Gin Leu He Thr Gly Thr Lys Pro Glu Asn 
325 330 ' 335 

Arg Gin Gin Glu Val Ser Asp He Ser Arg Gin Leu Lys He Leu Ala 
340 345 350 

Lys Glu Leu Lys Val Pro Val He Ala Leu Ser Gin Leu Ser Arg Gly 
355 360 365 

Val Glu Gin Arg Gin Asp Lys Arg Pro Val Leu Ser Asp He Arg Glu 
370 375 380 

Ser Gly Ser He Glu Gin Asp Ala Asp He Val Ala Phe Leu Tyr Arg 
385 390 395 400 

Asp Asp Tyr Tyr Arg Lys Glu Cys Asp Asp Ala Glu Glu Ala Val Glu 
405 410 415 

Asp Asn Thr He Glu Val He Leu Glu Lys Asn Arg Ala Gly Ala Arg 
420 425 430 

Gly Thr Val Lys Leu Met Phe Gin Lys Glu Tyr Asn Lys Phe Ser Ser 
435 440 445 

He Ala Gin Phe Glu Glu Arg 
450 455 

Fragments of the above polypeptides or proteins are also encompassed 
by the present invention. 

Suitable fragments can be produced by several means. In the first, 
subclones of the gene encoding the protein of the present invention are produced by 
conventional molecular genetic manipulation by subcloning gene fragments. The 
subclones then are expressed in vitro or in vivo in bacterial cells to yield a smaller 
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protein or peptide that can be tested for activity according to the procedures described 
below. 

As an alternative, fragments of replication proteins can be produced by 
digestion of a full-length replication protein with proteolytic enzymes like 
chymotrypsin or Staphylococcus proteinase A, or trypsin. Different proteolytic 
enzymes are likely to cleave replication proteins at different sites based on the amino 
acid sequence of the protein. Some of the fragments that result from proteolysis may 
be active and can be tested for activity as described below. 

In another approach, based on knowledge of the primary structure of 
the protein, fragments of a replication protein gene may be synthesized by using the 
PCR technique together with specific sets of primers chosen to represent particular 
portions of the protein. These then would be cloned into an appropriate vector for 
increased expression of a truncated peptide or protein. 

Chemical synthesis can also be used to make suitable fragments. Such 
a synthesis is carried out using known amino acid sequences of replication proteins 
being produced. Alternatively, subjecting a full length replication protein to high 
temperatures and pressures will produce fragments. These fragments can then be 
separated by conventional procedures (e.g., chromatography, SDS-PAGE). 

Variants may also (or alternatively) be modified by, for example, the 
deletion or addition of amino acids that have minimal influence on the properties, 
secondary structure, and hydropathic nature of the polypeptide. For example, a 
polypeptide may be conjugated to a signal (or leader) sequence at the N-terminal end 
of the protein which cotranslationally or post-translationally directs transfer of the 
protein. The polypeptide may also be conjugated to a linker or other sequence for 
ease of synthesis, purification, or identification of the polypeptide. 

Suitable DNA molecules are those that hybridize to a DNA molecule 
comprising a nucleotide sequence of at least about 20, more preferably at least about 
30 to about 50, continuous bases of either SEQ. ID. Nos. 1, 3, 5, 7, 9, 11, 13, 15, 17, 
19, 21, 23, 25, 27, 29, 31, or 33 under stringent conditions such as those characterized 
by a hybridization buffer comprising 0.9M sodium citrate ("SSC") buffer at a 
temperature of about 37°C and remaining bound when subject to washing the SSC 
buffer at a temperature of about 37°C; and preferably in a hybridization buffer 
comprising 20% formamide in 0.9M SSC buffer at a temperature of about 42°C and 
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remaining bound when subject to washing at about 42°C with 0.2x SSC buffer. 
Stringency conditions can be further varied by modifying the temperature and/or salt 
content of the buffer, or by modifying the length of the hybridization probe. 

The proteins or polypeptides of the present invention are preferably 
produced in purified form (preferably at least 80%, more preferably 90%, pure) by 
conventional techniques. Typically, the proteins or polypeptides of the present 
invention is secreted into the growth medium of recombinant host cells. 
Alternatively, the proteins or polypeptides of the present invention are produced but 
not secreted into growth medium. In such cases, to isolate the protein, the host cell 
(e.g., E. coli) carrying a recombinant plasmid is propagated, lysed by sonication, heat, 
or chemical treatment, and the homogenate is centrifuged to remove bacterial debris. 
The supernatant is then subjected to purification procedures such as ammonium 
sulfate precipitation, gel filtration, ion exchange chromatography, FPLC, and HPLC. 

The DNA molecule encoding replication polypeptides or proteins 
derived from Gram positive bacteria can be incorporated in cells using conventional 
recombinant DNA technology. Generally, this involved inserting the DNA molecule 
into an expression system to which the DNA molecule is heterologous (i.e. not 
normally present). The heterologous DNA molecule is inserted into the expression 
system or vector in proper sense orientation and correct reading frame. The vector 
contains the necessary elements for the transcription and translation of the inserted 
protein-coding sequences. 

U.S. Patent No. 4,237,224 to Cohen and Boyer, which is hereby 
incorporated by reference, describes the production of expression systems in the form 
of recombinant plasmids using restriction enzyme cleavage and ligation with DNA 
ligase. These recombinant plasmids are then introduced by means of transformation 
and replicated in unicellular cultures including procaryotic organisms and eucaryotic 
cells grown in tissue culture. 

Recombinant genes may also be introduced into viruses, such as 
vaccina virus. Recombinant viruses can be generated by transfection of plasmids into 
cells infected with virus. 

Suitable vectors include, but are not limited to, the following viral 
vectors such as lambda vector system gtl 1, gt WES.tB, Charon 4, and plasmid vectors 
such as pBR322, pBR325, pACYC177, pACYC1084, pUC8, pUC9, pUC18, pUC19, 
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pLG339 ; pR290, pKC37, pKClOl, SV 40, pBluescnpt II SK +/- or KS +/- (see 
"Stratagene Cloning Systems" Catalog (1993) from Stratagene, La Jolla, Calif, which 
is hereby incorporated by reference), pQE, pIH821 , pGEX, pET series (see 
F.W. Studier et al., "Use of T7 RNA Polymerase to Direct Expression of Cloned 
Genes," Gene Expression Technology vol. 185 (1990), which is hereby incorporated 
by reference), and any derivatives thereof. Recombinant molecules can be introduced 
into cells via transformation, particularly transduction, conjugation, mobilization, or 
electroporation. The DNA sequences are cloned into the vector using standard 
cloning procedures in the art, as described by Sambrook et al., Molecular Cloning: A 
Laboratory Manual, Cold Springs Laboratory, Cold Springs Harbor, New York 
(1989), which is hereby incorporated by reference. 

A variety of host-vector systems may be utilized to express the protein- 
encoding sequence(s). Primarily, the vector system must be compatible with the host 
cell used. Host-vector systems include but are not limited to the following: bacteria 
transformed with bacteriophage DNA, plasmid DNA, or cosmid DNA; 
microorganisms such as yeast containing yeast vectors; mammalian cell systems 
infected with virus (e.g., vaccinia virus, adenovirus, etc.); insect cell systems infected 
with virus (e.g., baculovirus); and plant cells infected by bacteria. The expression 
elements of these vectors vary in their strength and specificities. Depending upon the 
host-vector system utilized, anyone of a number of suitable transcription and 
translation elements can be used. 

Different genetic signals and processing events control many levels of 
gene expression (e.g., DNA transcription and messenger RNA (mRNA) translation). 

Transcription of DNA is dependent upon the presence of a promotor 
which is a DNA sequence that directs the binding of RNA polymerase and thereby 
promotes mRNA synthesis. The DNA sequences of eucaryotic promoters differ from 
those of procaryotic promoters. Furthermore, eucaryotic promoters and accompanying 
genetic signals may not be recognized in or may not function in a procaryotic system, 
and, further procaryotic promoters are not recognized and do not function in 
eucaryotic cells. 

Similarly, translation of mRNA in procaryotes depends upon the 
presence of the proper procaryotic signals which differ from those of eukaryotes. 
Efficient translation of mRNA in procaryotes requires a ribosome binding site called 
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the Shine-Dalgarno ("SD") sequence on the mRNA. This sequence is a short 
nucleotide sequence of mRNA that is located before the same codon, usually AUG, 
which encodes the amino-terminal methionine of the protein. The SD sequences are 
complementary to the 3 '-end of the 16S rRNA (ribosomal RNA) and probably 
promote binding of mRNA to ribosomes by duplexing with the rRNA to allow correct 
positioning of the ribosome. For a review on maximizing gene expression, see 
Roberts and Lauer, Methods in Enzymology , 68:473 (1979), which is hereby 
incorporated by reference. 

Promoters vary in their "strength" (i.e. their ability to promote 
transcription). For the purposes of expressing a cloned gene, it is desirable to use 
strong promoters in order to obtain a high level of transcription and, hence, expression 
of the gene. Depending upon the host cell system utilized, any one of a number of 
suitable promoters may be used. For instance, when cloning in E. coli, its 
bacteriophages, or plasmids, promoters such as the T7 phage promoter, lac promotor, 
trp promotor, recA promotor, ribosomal RNA promotor, the P R and P L promoters of 
coliphage lambda and others, including but not limited, to lac\JV5, ompF, bla, Ipp, 
and the like, may be used to direct high levels of transcription of adjacent DNA 
segments. Additionally, a hybrid trp-lac\JV5 (tac) promotor or other E. coli 
promoters produced by recombinant DNA or other synthetic DNA techniques may be 
used to provide for transcription of the inserted gene. 

Bacterial host cell strains and expression vectors may be chosen which 
inhibit the action of the promotor unless specifically induced. In certain operations, 
the addition of specific inducers is necessary for efficient transcription of the inserted 
DNA. For example, the lac operon is induced by the addition of lactose or IPTG 
(isopropylthio-beta-D-galactoside). A variety of other operons, such as trp, pro, etc., 
are under different controls. Additionally, the cell may carry the gene for a 
heterologous RNA polymerase such as from phage T7. Thus, a promoter specific for 
T7 RNA polymerase is used. The T7 RNA polymerase may be under inducible 
control. 

Specific initiation signals are also required for efficient gene 
transcription and translation in procaryotic cells. These transcription and translation 
initiation signals may vary in "strength" as measured by the quantity of gene specific 
messenger RNA and protein synthesized, respectively. The DNA expression vector, 
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which contains a promotor, may also contain any combination of various "strong" 
transcription and/or translation initiation signals. For instance, efficient translation in 
E. coli requires an SD sequence about 7-9 bases 5' to the initiation codon ("ATG") to 
provide a ribosome binding site. Thus, an SD-ATG combination that can be utilized 
by host cell ribosomes may be employed. Such combinations include but are not 
limited to the SD-ATG combination from the cro gene or the N gene of coliphage 
lambda, or from the E. coli tryptophan E, D, C, B or A genes. Additionally, any SD- 
ATG combination produced by recombinant DNA or other techniques involving 
incorporation of synthetic nucleotides may be used. 

Once the isolated DNA molecule encoding a replication polypeptide or 
protein has been cloned into an expression system, it is ready to be incorporated into a 
host cell. Such incorporation can be carried out by the various forms of 
transformation noted above, depending upon the vector/host cell system. Suitable host 
cells include, but are not limited to, bacteria, viruses, yeast, mammalian cells, insects, 
plants, and the like. 

The invention provides efficient methods of identifying 
pharmacological agents or lead compounds for agents active at the level of a 
replication protein function, particularly DNA replication. Generally, these screening 
methods involve assaying for compounds which interfere with the replication activity. 
The methods are amenable to automated, cost-effective high throughput screening of 
chemical libraries for lead compounds. Identified reagents find use in the 
pharmaceutical industries for animal and human trials; for example, the reagents may 
be derivatized and rescreened in in vitro and in vivo assays to optimize activity and 
minimize toxicity for pharmaceutical development. Target therapeutic indications are 
limited only in that the target cellular function be subject to modulation, usually 
inhibition, by disruption of a replication activity or the formation of a complex 
comprising a replication protein and one or more natural intracellular binding targets. 
Target indications may include arresting cell growth or causing cell death resulting in 
recovery from the bacterial infection in animal studies. 

A wide variety of assays for activity and binding agents are provided, 
including DNA synthesis, ATPase, clamp loading onto DNA, protein-protein binding 
assays, immunoassays, cell based assays, etc. The replication protein compositions, 
used to identify pharmacological agents, are in isolated, partially pure or pure form 
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and are typically recombinantly produced. The replication protein may be part of a 
fusion product with another peptide or polypeptide (e.g., a polypeptide that is capable 
of providing or enhancing protein-protein binding, stability under assay conditions 
(e.g., a tag for detection or anchoring), etc.). The assay mixtures comprise a natural 
5 intracellular replication protein binding target such as DNA, another protein, NTP, or 

dNTP. For binding assays, while native binding targets may be used, it is frequently 
preferred to use portions (e.g., peptides, nucleic acid fragments) thereof so long as the 
portion provides binding affinity and avidity to the subject replication protein 
conveniently measurable in the assay. The assay mixture also comprises a candidate 

10 pharmacological agent. Generally, a plurality of assay mixtures are run in parallel 

with different agent concentrations to obtain a differential response to the various 
concentrations. Typically, one of these concentrations serves as a negative control 
(i.e., at zero concentration or below the limits of assay detection). Additional controls 
are often present such as a positive control, a dose response curve, use of known 

15 inhibitors, use of control heterologous proteins, etc. Candidate agents encompass 

numerous chemical classes, though typically they are organic compounds; preferably 
they are small organic compounds and are obtained from a wide variety of sources, 
including libraries of synthetic or natural compounds. A variety of other reagents may 
also be included in the mixture. These include reagents like salts, buffers, neutral 

20 proteins (e.g., albumin, detergents, etc.), which may be used to facilitate optimal 

binding and/or reduce nonspecific or background interactions, etc. Also reagents that 
otherwise improve the efficiency of the assay (e.g., protease inhibitors, nuclease 
inhibitors, antimicrobial agents, etc.) may be used. 

The invention provides replication protein specific assays and the 

25 binding agents including natural intracellular binding targets such as other replication 

proteins, etc., and methods of identifying and making such agents, and their use in a 
variety of diagnostic and therapeutic applications, especially where disease is 
associated with excessive cell growth. Novel replication protein-specific binding 
agents include replication protein-specific antibodies and other natural intracellular 

30 binding agents identified with assays such as one- and two-hybrid screens, non-natural 

intracellular binding agents identified in screens of chemical libraries, etc. 

Generally, replication protein-specificity of the binding agent is shown 
by binding equilibrium constants. Such agents are capable of selectively binding a 
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replication protein (i.e., with an equilibrium constant at least about 10' M" 1 , 
preferably, at least about 10 8 M" 1 , more preferably, at least about 10 g M"'). A wide 
variety of cell-based and cell-free assays may be used to demonstrate replication 
protein-specific activity, binding, gel shift assays, immunoassays, etc. 
5 The resultant mixture is incubated under conditions whereby, but for 

the presence of the candidate pharmacological agent, the replication protein 
specifically binds the cellular binding target, portion, or analog. The mixture of 
components can be added in any order that provides for the requisite bindings. 
Incubations may be performed at any temperature which facilitates optimal binding, 

10 typically between 4°C and 40°C, more commonly between 15°C and 40°C. Incubation 

periods are likewise selected for optimal binding but also minimized to facilitate 
rapid, high-throughput screening, and are typically between 0.1 and 10 hours, 
preferably less than 5 hours, more preferably less than 2 hours. 

After incubation, the presence or absence of activity or specific binding 

15 between the replication protein and one or more binding targets is detected by any 

convenient way. For cell-free activity and binding type assays, a separation step may 
be used to separate the activity product or the bound from unbound components. 
Separation maybe effected by precipitation (e.g., immunoprecipitation), 
immobilization (e.g., on a solid substrate such as a microti ter plate), etc., followed by 

20 washing. Many assays that do not require separation are also possible such as use of 

europium conjugation in proximity assays or a detection system that is dependent on a 
product or loss of substrate. 

Detection may be effected in any convenient way. For cell-free activity 
and binding assays, one of the components usually comprises or is coupled to a label. 

25 A wide variety of labels may be employed - essentially any label that provides for 

detection of DNA product, loss of DNA substrate, conversion of a nucleotide 
substrate, or bound protein is useful. The label may provide for direct detection such 
as radioactivity, fluorescence, luminescence, optical, or electron density, etc. or 
indirect detection such as an epitope tag, an enzyme, etc. The label maybe appended 

30 to the protein (e.g., a phosphate group comprising a radioactive isotope of 

phosphorous), or incorporated into the DNA substrate or the protein structure (e.g., a 
methionine residue comprising a radioactive isotope of sulfur.) A variety of methods 
may be used to detect the label depending on the nature of the label and other assay 
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components. For example, the label may be detected bound to the solid substrate, or a 
portion of the bound complex containing the label may be separated from the solid 
substrate, and thereafter the label detected. Labels may be directly detected through 
optical or electron density, radioactive emissions, nonradiative energy transfer, 
fluorescence emission, etc. or indirectly detected with antibody conjugates, etc. For 
example, in the case of radioactive labels, emissions may be detected directly (e.g., 
with particle counters) or indirectly (e.g., with scintillation cocktails and counters). 

The present invention identifies the set of proteins that together result 
in a three component polymerase from bacteria that are distantly related to E. coli, 
such as Gram positive bacteria. Specifically, these bacteria lack several genes that E. 
coli DNA polymerase HI has, such as holD, holD or holE. Further, dnaX is believed 
to encode only one protein, tau. Also, holA is quite divergent in homology suggesting 
it may function in another process in these organisms. Gram positive cells even have 
replication genes that E. coli does not, implying that they may not utilize the 
replication strategies exemplified by E. coli. 

The present invention identifies genes and proteins that form a three 
component polymerase in Gram positive organisms, such as S. pyogenes and S. 
aureus. In S. pyogenes and S. aureus, the polymerase a-large, functions with a 
(3 clamp and a clamp loader component of x85'. They display high speed and 
processivity in synthesis of ssDNA coated with SSB and primed with a DNA 
oligonucleotide. 

This invention also expresses and purifies a protein from a Gram 
positive bacteria that is homologous to the E. coli beta subunit. The invention 
demonstrates that it behaves like a circular protein. Further, this invention shows that 
a beta subunit from a Gram positive bacteria is functional with both Pol ITI-L (a-large) 
from a Gram positive bacteria and with DNA polymerase HI from a Gram negative 
bacteria. This result can be explained by an interaction between the clamp and the 
polymerase that has been conserved during the evolutionary divergence of Gram 
positive and Gram negative cells. A chemical inhibitor that would disrupt this 
interaction would be predicted to have a broad spectrum of antibiotic activity, shutting 
down replication in gram negative and gram positive cells alike. This assay, and 
others based on this interaction, can be devised to screen chemicals for such 
inhibition. Further, since all the proteins in this assay are highly overexpressed 
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through recombinant techniques, sufficient quantities of the protein reagents can be 
obtained for screening hundreds of thousands of compounds. 

This invention also shows that the DnaE polymerase (a-small), 
encoded by the dnaE gene, functions with the beta clamp and x55' complex. The 
5 speed of DnaE is not significantly increased by t55' and (3, but the processivity of 

DnaE is greatly increased by t55' and p. Hence, the DnaE polymerase, coupled with 
its P clamp on DNA (loaded by t56') may also be an important target for a candidate 
pharmaceutical drug. 

The present invention provides methods by which replication proteins 

10 from a Gram positive bacteria are used to discover new pharmaceutical agents. The 

function of replication proteins is quantified in the presence of different chemical 
compounds. A chemical compound that inhibits the function is a candidate antibiotic. 
Some replication proteins from a Gram positive bacteria and from a Gram negative 
bacteria can be interchanged for one another. Hence, they can function as mixtures. 

15 Reactions that assay for the function of enzyme mixtures consisting of proteins from 

Gram positive bacteria and from Gram negative bacteria can also be used to discover 
drugs. Suitable E. coli replication proteins are the subunits of its Pol ITJ holoenzyme 
which are described in U.S. Patent Nos. 5,583,026 and 5,668,004 to O'Donnell, which 
are hereby incorporated by reference. 

20 The methods described herein to obtain genes, and the assays 

demonstrating activity behavior of S. pyogenes and S. aureus replication proteins are 
likely to generalize to all members of the Streptococcus and Staphylococcus genuses, 
as well as to all Gram positive bacteria. Such assays are also likely to generalize to 
other cells besides Gram positive bacteria which also share features in common with 

25 S. pyogenes and S. aureus that are different from E. coli (i.e., lacking holC, holD, or 

holE; having a dnaX gene encoding a single protein; or having a weak homology to 
holA encoding delta). 

The present invention describes a method of identifying compounds 
which inhibit the activity of a polymerase product of polC or dnaE. This method is 

30 carried out by forming a reaction mixture that includes a primed DNA molecule, a 

polymerase product of polC or dnaE, a candidate compound, a dNTP, and optionally 
either a beta subunit, a tau complex, or both the beta subunit and the tau complex, 
wherein at least one of the polymerase product of polC or dnaE, the beta subunit, the 
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tau complex, or a subunit or combination of subunits thereof is derived from a 
Eubacteria other than Escherichia coli; subjecting the reaction mixture to conditions 
effective to achieve nucleic acid polymerization in the absence of the candidate 
compound; analyzing the reaction mixture for the presence or absence of nucleic acid 
polymerization extension products; and identifying the candidate compound in the 
reaction mixture where there is an absence of nucleic acid polymerization extension 
products. Preferably, the polymerase product of polC or dnaE, the beta subunit, the 
tau complex, or the subunit or combination of subunits thereof is derived from a Gram 
positive bacterium, more preferably a Streptococcus bacterium such as S. pyogenes or 
a Staphylococcus bacterium such as S. aureus. 

The present invention describes a method to identify chemicals that 
inhibit the activity of the three component polymerase. This method involves 
contacting primed DNA with the DNA polymerase in the presence of the candidate 
pharmaceutical, and dNTPs (or modified dNTPs) to form a reaction mixture. The 
reaction mixture is subjected to conditions effective to achieve nucleic acid 
polymerization in the absence of the candidate pharmaceutical and the presence or 
absence of the extension product in the reaction mixture is analyzed. The candidate 
pharmaceutical is detected by the absence of product. 

The present invention describes a method to identify candidate 
pharmaceuticals that inhibit the activity of a clamp loader complex and a beta subunit 
in stimulating the DNA polymerase. The method includes contacting a primed DNA 
(which may be coated with SSB) with a DNA polymerase, a beta subunit, and a tau 
complex (or subunit or subassembly of the tau complex) in the presence of the 
candidate pharmaceutical, and dNTPs (or modified dNTPs) to form a reaction 
mixture. The reaction mixture is subjected to conditions which, in the absence of the 
candidate pharmaceutical, would effect nucleic acid polymerization and the presence 
or absence of the extension product in the reaction mixture is analyzed. The candidate 
pharmaceutical is detected by the absence of product. The DNA polymerase, the beta 
subunit, and/or the tau complex or subunit(s) thereof are derived from a Gram positive 
bacterium. 

The present invention describes a method to identify chemicals that 
inhibit the ability of a beta subunit and a DNA polymerase to interact physically. This 
method involves contacting the beta subunit with the DNA polymerase in the presence 
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of the candidate pharmaceutical to form a reaction mixture. The reaction mixture is 
subjected to conditions under which the DNA polymerase and the beta subunit would 
interact in the absence of the candidate pharmaceutical. The reaction mixture is then 
analyzed for interaction between the beta unit and the DNA polymerase. The 
candidate pharmaceutical is detected by the absence of interaction between the beta 
subunit and the DNA polymerase. The DNA polymerase and/or the beta subunit are 
derived from a Gram positive bacterium. 

The present invention describes a method to identify chemicals that 
inhibit the ability of a beta subunit and a tau complex (or a subunit or subassembly of 
the tau complex) to interact. This method includes contacting the beta subunit with 
the tau complex (or subunit or subassembly of the tau complex) in the presence of the 
candidate pharmaceutical to form a reaction mixture. The reaction mixture is 
subjected to conditions under which the tau complex (or the subunit or subassembly 
of the tau complex) and the beta subunit would interact in the absence of the candidate 
pharmaceutical. The reaction mixture is then analyzed for interaction between the 
beta subunit and the tau complex (or the subunit or subassembly of the tau complex). 
The candidate pharmaceutical is detected by the absence of interaction between the 
beta subunit and the tau complex (or the subunit or subassembly of the tau complex) . 
The beta subunit and/or the tau complex or subunit thereof is derived from a Gram 
positive bacterium. 

The present invention describes a method to identify chemicals that 
inhibit the ability of a tau complex (or a subassembly of the tau complex) to assemble 
a beta subunit onto a DNA molecule. This method involves contacting a circular 
primed DNA molecule (which may be coated with SSB) with the tau complex (or the 
subassembly thereof) and the beta subunit in the presence of the candidate 
pharmaceutical, and ATP or dATP to form a reaction mixture. The reaction mixture 
is subjected to conditions under which the tau complex (or subassembly) assembles 
the beta subunit on the DNA molecule absent the candidate pharmaceutical. The 
presence or absence of the beta subunit on the DNA molecule in the reaction mixture 
is analyzed. The candidate pharmaceutical is detected by the absence of the beta 
subunit on the DNA molecule. The beta subunit and/or the tau complex are derived 
from a Gram positive bacterium. 
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The present invention describes a method to identify chemicals that 
inhibit the ability of a tau complex (or a subunit(s) of the tau complex) to disassemble 
a beta subunit from a DNA molecule. This method comprises contacting a DNA 
molecule onto which the beta subunit has been assembled in the presence of the 
candidate pharmaceutical, to form a reaction mixture. The reaction mixture is 
subjected to conditions under which the tau complex (or a subunit(s) or subassembly 
of the tau complex) disassembles the beta subunit from the DNA molecule absent the 
candidate pharmaceutical. The presence or absence of the beta subunit on the DNA 
molecule in the reaction mixture is analyzed. The candidate pharmaceutical is 
detected by the presence of the beta subunit on the DNA molecule. The beta subunit 
and/or the tau complex are derived from a Gram positive bacterium. 

The present invention describes a method to identify chemicals that 
disassemble a beta subunit from a DNA molecule. This method involves contacting a 
circular primed DNA molecule (which may be coated with SSB) upon which the beta 
subunit has been assembled (e.g. by action of the tau complex) with the candidate 
pharmaceutical. The presence or absence of the beta subunit on the DNA molecule in 
the reaction mixture is analyzed. The candidate pharmaceutical is detected by the 
absence of the beta subunit on the DNA molecule. The beta subunit is derived from a 
Gram positive bacterium. 

The present invention describes a method to identify chemicals that 
inhibit the dATP/ATP binding activity of a tau complex or a tau complex subunit (e.g. 
tau subunit). This method includes contacting the tau complex (or the tau complex 
subunit) with dATP/ATP either in the presence or absence of a DNA molecule and/or 
the beta subunit in the presence of the candidate pharmaceutical to form a reaction. 
The reaction mixture is subjected to conditions in which the tau complex (or the 
subunit of tau complex) interacts with dATP/ATP in the absence of the candidate 
pharmaceutical. The reaction is analyzed to determine if dATP/ATP is bound to the 
tau complex (or the subunit of tau complex) in the presence of the candidate 
pharmaceutical. The candidate pharmaceutical is detected by the absence of 
hydrolysis. The tau complex and/or the beta subunit is derived from a Gram positive 
bacterium. 

The present invention describes a method to identify chemicals that 
inhibit the dATP/ATPase activity of a tau complex or a tau complex subunit (e.g., the 
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tau subunit). This method involves contacting the tau complex (or the tau complex 
subunit) with d ATP/ ATP either in the presence or absence of a DNA molecule and/or 
a beta subunit in the presence of the candidate pharmaceutical to form a reaction 
mixture. The reaction mixture is subjected to conditions in which the tau subunit (or 
complex) hydrolyzes dATP/ATP in the absence of the candidate pharmaceutical. The 
reaction is analyzed to determine if dATP/ATP was hydrolyzed. Suitable candidate 
pharmaceuticals are identified by the absence of hydrolysis. The tau complex and/or 
the beta subunit is derived from a Gram positive bacterium. 

Further methods for identifying chemicals that inhibit the activity of a 
DNA polymerase encoded by either the dnaE gene, polC gene, or their accessory 
proteins (i.e., clamp loader, clamp, etc.), are as follows: 

1) Contacting a primed DNA molecule with the encoded product 
of the dnaE gene or polC gene in the presence of the candidate pharmaceutical, and 
dNTPs (or modified dNTPs) to form a reaction mixture. The reaction mixture is 
subjected to conditions, which in the absence of the candidate pharmaceutical, affect 
nucleic acid polymerization and the presence or absence of the extension product in 
the reaction mixture is analyzed. The candidate pharmaceutical is detected by the 
absence of extension product. The protein encoded by the dnaE gene and PolC gene 
is derived from a Gram positive bacterium. 

2) Contacting a linear primed DNA molecule with a beta subunit 
and the encoded product of dnaE or PolC in the presence of the candidate 
pharmaceutical, and dNTPs (or modified dNTPs) to form a reaction mixture. The 
reaction mixture is subjected to conditions, which in the absence of the candidate 
pharmaceutical, affect nucleic acid polymerization, and the presence or absence of the 
extension product in the reaction mixture is analyzed. The candidate pharmaceutical 
is detected by the absence of extension product. The protein encoded by the dnaE 
gene and PolC gene is derived from a Gram positive bacterium. 

3) Contacting a circular primed DNA molecule (may be coated 
with SSB) with a tau complex, a beta subunit and the encoded product of a dnaE gene 
or PolC gene in the presence of the candidate pharmaceutical, and dNTPs (or 
modified dNTPs) to form a reaction mixture. The reaction mixture is subjected to 
conditions, which in the absence of the candidate pharmaceutical, affect nucleic acid 
polymerization, and the presence or absence of the extension product in the reaction 
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mixture is analyzed. The candidate pharmaceutical is detected by the absence of 
product. The protein encoded by the dnaE gene and PolC gene, the beta subunit, 
and/or the tau complex are derived from a Gram positive bacterium. 

4) Contacting a beta subunit with the product encoded by a dnaE 
5 gene or PolC gene in the presence of the candidate pharmaceutical to form a reaction 

mixture. The reaction mixture is then analyzed for interaction between the beta 
subunit and the product encoded by the dnaE gene or PolC gene. The candidate 
pharmaceutical is detected by the absence of interaction between the beta subunit and 
the product encoded by the dnaE gene or PolC gene. The beta subunit and/or the 
1 0 protein encoded by the dnaE gene and PolC gene is derived from a Gram positive 

bacterium. 

5) The present invention discloses a method to identify chemicals 
that inhibit a DnaB helicase. The method includes contacting the DnaB helicase with 
a DNA molecule substrate that has a duplex region in the presence of a nucleoside or 

15 deoxynucleoside triphosphate energy source and a candidate pharmaceutical to form a 

reaction mixture. The reaction mixture is subjected to conditions that support helicase 
activity in the absence of the candidate pharmaceutical. The DNA duplex molecule in 
the reaction mixture is analyzed for whether it is converted to ssDNA. The candidate 
pharmaceutical is detected by the absence of conversion of the duplex DNA molecule 

20 to the ssDNA molecule. The DnaB helicase is derived from a Gram positive 

bacterium. 

6) The present invention describes a method to identify chemicals 
that inhibit the nucleoside or deoxynucleoside triphosphatase activity of a DnaB 
helicase. The method includes contacting the DnaB helicase with a DNA molecule 

25 substrate that has a duplex region in the presence of a nucleoside or deoxynucleoside 

triphosphate energy source and the candidate pharmaceutical to form a reaction 
mixture. The reaction mixture is subjected to conditions that support nucleoside or 
deoxynucleoside triphosphatase activity of the DnaB helicase in the absence of the 
candidate pharmaceutical. The candidate pharmaceutical is detected by the absence of 

30 conversion of nucleoside or deoxynucleoside triphosphate to nucleoside or 

deoxynucleoside diphosphate. The DnaB helicase is derived from a Gram positive 
bacterium. 
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7) The present invention describes a method to identify chemicals 
that inhibit a primase. The method includes contacting primase with a ssDNA 
molecule in the presence of a candidate pharmaceutical to form a reaction mixture. 
The reaction mixture is subjected to conditions that support primase activity (e.g., the 
presence of nucleoside or deoxynucleoside triphosphates, appropriate buffer, presence 
or absence of DnaB helicase) in the absence of the candidate pharmaceutical. Suitable 
candidate pharmaceuticals are identified by the absence of primer formation detected 
either directly or indirectly. The primase is derived from a Gram positive bacterium. 

8) The present invention describes a method to identify chemicals 
that inhibit the ability of a primase and the protein encoded by a dnaB gene to interact. 
This method includes contacting the primase with the protein encoded by the dnaB 
gene in the presence of the candidate pharmaceutical to form a reaction mixture. The 
reaction mixture is subjected to conditions under which the primase and the protein 
encoded by the dnaB gene interact in the absence of the candidate pharmaceutical. 
The reaction mixture is then analyzed for interaction between the primase and the 
protein encoded by the dnaB gene. The candidate pharmaceutical is detected by the 
absence of interaction between the primase and the protein encoded by the dnaB gene. 
The primase and/or the dnaB gene are derived from a Gram positive bacterium. 

9) The present invention describes a method to identify chemicals 
that inhibit the ability of a protein encoded by a dnaB gene to interact with a DNA 
molecule. This method includes contacting the protein encoded by the dnaB gene 
with the DNA molecule in the presence of the candidate pharmaceutical to form a 
reaction mixture. The reaction mixture is subjected to conditions under which the 
DNA molecule and the protein encoded by the dnaB gene interact in the absence of 
the candidate pharmaceutical. The reaction mixture is then analyzed for interaction 
between the protein encoded by the dnaB gene and the DNA molecule. The candidate 
pharmaceutical is detected by the absence of interaction between the DNA molecule 
and the protein encoded by the dnaB gene. The dnaB gene is derived from a Gram 
positive bacterium. 
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EXAMPLES 

The following examples are provided to illustrate embodiments of the 
present invention, but they are by no means intended to limit its scope. 

Example 1 - Materials 

Labeled deoxy- and ribonucleoside triphosphates were from Dupont- 
New England Nuclear; unlabelled deoxy- and ribonucleoside triphosphates were from 
Pharmacia- LKB; E. coli replication proteins were purified as described, alpha, 
epsilon, gamma, and tau (Studwell et al., "Processive Replication is Contingent on the 
Exonuclease Subunit of DNA Polymerase m Holoenzyme," J. Biol. Chem. , 265:1171- 
1 178 (1990), which is hereby incorporated by reference), beta (Kong et al., "Three 
Dimensional Structure of the Beta Subunit of Escherichia coli DNA Polymerase ITJ 
Holoenzyme: A Sliding DNA Clamp," Cell, 69:425-437 (1992), which is hereby 
incorporated by reference), delta and delta prime (Dong et al., "DNA Polymerase III 
Accessory Proteins. I. HolA and holB Encoding 8 and 5'," J. Biol. Chem. , 268:1 1 758- 
1 1765 (1993), which is hereby incorporated by reference), chi and psi (Xiao et al., 
"DNA Polymerase EQ Accessory Proteins. III. HolC and holD Encoding chi and psi," 
J. Biol. Chem. , 268:1 1773-1 1778 (1993), which is hereby incorporated by reference), 
theta (Studwell- Vaughan et al., "DNA Polymerase m Accessory Proteins. V. Theta 
Encoded by holE," J. Biol. Chem. , 268:11785-11791 (1993), which is hereby 
incorporated by reference), and SSB (Weiner et al., "The Deoxyribonucleic Acid 
Unwinding Protein of Escherichia coli," J. Biol. Chem. , 250:1972-1980 (1975), 
which is hereby incorporated by reference). E. coli Pol EQ core and clamp loader 
complex (composed of subunits gamma, delta, delta prime, chi, and psi) were 
reconstituted as described in Onrust et al., "Assembly of a Chromosomal Replication 
Machine: Two DNA Polymerases, a Clamp Loader and Sliding Clamps in One 
Holoenzyme Particle. I. Organization of the Clamp Loader," J. Biol. Chem. , 
270:13348-13357 (1995), which is hereby incorporated by reference. Pol ffl* was 
reconstituted and purified as described in Onrust et al., "Assembly of a Chromosomal 
Replication Machine: Two DNA Polymerases, a Clamp Loader and Sliding Clamps 
in One Holoenzyme Particle. III. Interface Between Two Polymerases and the Clamp 
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Loader" J. Biol. Chem. , 270:13366-13377 (1995), which is hereby incorporated by 
reference. Protein concentrations were quantitated by the Protein Assay (Bio-Rad) 
method using bovine serum albumin (BSA) as a standard. DNA oligonucleotides 
were synthesized by Oligos etc. Calf thymus DNA was from Sigma. Buffer A is 20 
5 mM Tris-HCl (pH=7.5), 0.5 mM EDTA, 2 mM DTT, and 20% glycerol. Replication 

buffer is 20 mM Tris-Cl (pH 7.5), 8 mM MgCl 2 , 5 mM DTT, 0.5 mM EDTA, 40 
ng/ml BSA, 4% glycerol, 0.5 mM ATP, 3 mM each dCTP, dGTP, dATP, and 20 pM 
[a- 32 P]dTTP. P-cell buffer is 50 mM potassium phosphate (pH 7.6), 5 mM DTT, 0.3 
mM EDTA, 20% glycerol. T.E. buffer is 10 mM Tris-HCl (pH 7.5), 1 mM EDTA. 
10 Cell lysis buffer is 50 mM Tris-HCl (pH 8.0) 10 % sucrose, 1 M NaCl, 0.3 mM 

spermidine. 

Example 2 - Calf Thymus DNA Replication Assays 

15 These assays were used in the purification of DNA polymerases from 

S. aureus cell extracts. Assays contained 2.5 pg activated calf thymus DNA in a final 
volume of 25 nl replication buffer. An aliquot of the fraction to be assayed was added 
to the assay mixture on ice followed by incubation at 37°C for 5 min. DNA synthesis 
was quantitated using DE81 paper as described in Rowen et al., "Primase, the DnaG 

20 Protein of Escherichia coli. An Enzyme Which Starts DNA Chains," J. Biol. Chem. , 

253:758-764 (1979), which is hereby incorporated by reference. 

Example 3 - PolydA-oligodT Replication Assays 

25 PolydA-oligodT was prepared as follows. PolydA of average length 

4500 nucleotides was purchased from SuperTecs. OHgodT35 was synthesized by 
Oligos etc. 145 ul of 5.2 mM (as nucleotide) polydA and 22 p.1 of 1.75 mM (as 
nucleotide) oligodT were mixed in a final volume of 2100 pi T.E. buffer (ratio as 
nucleotide was 21:1 polydA to oligodT). The mixture was heated to boiling in a 1 ml 

30 eppendorf tube, then removed and allowed to cool to room temperature. Assays were 

performed in a final volume of 25 \i\ 20 mM Tris-Cl (pH 7.5), 8 mM MgCl2, 5 mM 
DTT, 0.5 mM EDTA, 40 jig/ml BSA, 4% glycerol, containing 20 ^M [a- 32 P]dTTP 
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and 0.36 ng polydA-oligodT. Proteins were added to the reaction on ice, then shifted 
to 37°C for 5 min. DNA synthesis was quantitated using DE81 paper as described in 
Rowen et al., "Primase, the DnaG Protein of Escherichia coli. An Enzyme Which 
Starts DNA Chains," J. Biol. Chem. , 253:758-764 (1979), which is hereby 
5 incorporated by reference. 

Example 4 - Singly Primed M13mpl8 ssDNA Replication Assays 

M13mpl8 was phenol extracted from phage and purified by two 
10 successive bandings (one downward and one upward) in cesium chloride gradients. 

M 1 3mpl 8 ssDNA was singly primed with a DNA 30m er (map position 68 1 7-6846) as 
described in Studwell et al. "Processive Replication is Contingent on the Exonuclease 
Subunit of DNA Polymerase PI Holoenzvme," J. Biol. Chem. . 265:1171-1178 (1990), 
which is hereby incorporated by reference. Replication assays contained 72 ng of 
15 singly primed M 1 3mp 1 8 ssDNA in a final volume of 25 nl of replication buffer. 

Other proteins added to the assay, and their amounts, are indicated in the Brief 
Description of the Drawings. Reactions were incubated for 5 min. at 37°C and then 
were quenched upon adding an equal volume of 1 % SDS and 40 mM EDTA. DNA 
synthesis was quantitated using DE81 paper as described in Rowen et al., "Primase, 
20 the DnaG Protein of Escherichia coli. An Enzyme Which Starts DNA Chains," J. 

Biol. Chem. , 253:758-764 (1979), which is hereby incorporated by reference, and 
product analysis was performed in a 0.8% native agarose gel followed by 
autoradiography. 

25 Example 5 - Genomic Staphylococcus aureus DNA 

Two strains of S. aureus were used. For PCR of the first fragment of 
the dnaXgene sequence, the strain was ATCC 25923. For all other work the strain 
was strain 4220 (a gift of Dr. Pat Schlievert, University of Minnisota). This strain 
30 lacks a gene needed for producing toxic shock (Kreiswirth et al., "The Toxic Shock 

Syndrome Exotoxin Structural Gene is Not Detectably Transmitted by a Prophage," 
Nature . 305:709-712 (1996) and Balan et al., "Autocrine Regulation of Toxin 
Synthesis by Staphylococcus aureus," Proc. Natl. Acad. Sci. USA . 92:1619-1623 
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(1995), which are hereby incorporated by reference). S. aureus cells were grown 
overnight at 37°C in LB containing 0.5% glucose. Cells were collected by 
centrifugation (24 g wet weight). Cells were resuspended in 80 ml solution I (50 mM 
glucose, 1 0 mM EDTA, 25 mM Tris-HCL (pH 8.0)). SDS and NaOH were then 
added to 1% and 0.2 N, respectively, followed by incubation at 65°C for 30 min. to 
lyse the cells. 68.5 ml of 3 M sodium acetate (pH 5.0) was added followed by 
centrifugation at 12,000 rpm for 30 min. The supernatant was discarded and the pellet 
was washed twice with 50 ml of 6M urea, 10 mM Tris-HCL (pH 7.5), 1 mM EDTA 
using a dounce homogenizer. After each wash, the resuspended pellet was collected 
by centrifugation (12,000 rpm for 20 min.). After the second wash, the pellet was 
resuspended in 50 ml 10 mM T.E. buffer using a dounce homogenizer and then 
incubated for 30 min. at 65°C. The solution was centrifuged at 12,000 rpm for 20 
min., and the viscous supernatant was collected. 43.46 g CsCb was added to the 50 
ml of supernatant (density between 1.395-1.398) and poured into two 35 ml quick seal 
ultracentrifuge tubes (tubes were completely filled using the same density of CsCl2 in 
T.E.). To each tube was added 0.5 ml of a 10 mg/ml stock of ethidium bromide. 
Tubes were spun at 55,000 rpm for 1 8 h at 18°C in a Sorvall TV860 rotor. The band 
of genomic DNA was extracted using a syringe and needle. Ethidium bromide was 
removed using two butanol extractions and then dialyzed against 4 1 of T.E. at pH 8.0 
overnight. The DNA was recovered by ethanol precipitation and then resuspended in 
T.E. buffer (1 .7 mg total) and stored at -20°C. 

Example 6 - Cloning and Purification of S. aureus Pol III-L 

To further characterize the mechanism of DNA replication in S. 
aureus, large amounts of its replication proteins were produced through use of the 
genes. The polC gene encoding S. aureus Pol ELL (alpha-large) subunit has been 
sequenced and expressed in E. coli (Pacitti et al., "Characterization and 
Overexpression of the Gene Encoding Staphylococcus aureus DNA Polymerase in," 
Gene , 165:51-56 (1995), which is hereby incorporated by reference). The previous 
work utilized a pBS[KS] vector for expression in which the E. coli RNA polymerase 
is used for gene transcription. In the earlier study, the S. aureus polC gene was 
precisely cloned at the 5' end encoding the N-terminus, but the amount of the gene 
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that remained past the 3' end was not disclosed and the procedure for subcloning the 
gene into the expression vector was only briefly summarized. Furthermore, the 
previous study does not show the level of expression of the S. aureus Pol Ul-L, nor the 
amount of S. aureus Pol IT1-L that is obtained from the induced cells. Since the 
5 previously published procedure could not be repeated and the efficiency of the 

expression vector could not be assessed, another strategy outlined below had to be 
developed. 

The isolated polC gene was cloned into a vector that utilizes T7 RNA 
polymerase for transcription as this process generally expresses a large amount of 

10 protein. Hence, the S. aureus polC gene was cloned precisely into the start codon at 

the Ndel site downstream of the T7 promotor in a pET vector . As the polC gene 
contains an internal Ndel site, the entire gene could not be amplified and placed it into 
the Ndel site of a pET vector. Hence, a three step cloning strategy that yielded the 
desired clone was devised (Figure 1). These attempts were quite frustrating initially 

15 as no products of cloning in standard E. coli strains such as DH5ct, a typical 

laboratory strain for preparation of DNA, could be obtained. Finally, a cell that was 
mutated in several genes affecting DNA stability was useful in obtaining the desired 
products of cloning. 

In brief, the cloning strategy required use of another expression vector 

20 (called pETl 137kDa) in which the 37 kDa subunit of human RFC, the clamp loader 

of the human replication system, had been cloned into the pETl 1 vector. The gene 
encoding the 37kDa subunit contains an internal Nsil site, which was needed for the 
precise cloning of the isolated polC gene. This three step strategy is shown in 
Figure 1 . In the first step, an approximately 2.3 kb section of the 5' section of the gene 

25 (encoding the N-terminus of Pol HI-L) was amplified using the polymerase chain 

reaction (PCR). Primers were as follows: 

Upstream (SEQ. ID. No. 35) 

ggtggtaatt gtcttg cata tcjacagagc 29 

30 



Downstream (SEQ. ID. No. 36) 
agcgattaag tggattgccg ggttgtgatg c 



31 
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Amplification was performed using 500 ng genomic DNA, 0.5 mM EDTA, 1 of 
each primer, ImM MgSCM, 2 units vent DNA polymerase (New England Biolabs) in 
100 of vent buffer (New England Biolabs). Forty cycles were performed using the 
following cycling scheme: 94°C, 1 min; 60°C, 1 min.; 72°C, 2.5 min. The product 
was digested with Ndel (underlined in the upstream primer) and Nsil (an internal site 
in the product) and the approximately 1 .8 kb fragment was gel purified. A pETl 1 
vector containing as an insert the 37 kDa subunit of human replication factor C 
(pETl 137kDa) was digested with Ndel and Nsil and gel purified. The PCR fragment 
was ligated into the digested pETl 137kDa vector and the ligation reaction was 
transformed into Epicurean coli supercompetent SURE 2 cells (Stratagene) and 
colonies were screened for the correct chimera (pETl lPolCl) by examining 
minipreps for proper length and correct digestion products using Ndel and Nsil. In the 
second step, an approximately 2076 bp fragment containing the DNA encoding the C- 
terminus of Pol ID-L subunit was amplified using the following sequences as primers: 

Upstream (SEQ. ID. No. 37) 

agcatcacaa cccggcaatc cacttaatcg c 31 
Downstream (SEQ. ID. No. 38) 

gactacg cca tcjggcattaa ataaatacc 29 

The amplification cycling scheme was as described above except the elongation step 
at 72°C was for 2 min. The product was digested with BamHI (underlined in the 
downstream primer) and Nsil (internal to the product) and the approximately 480 bp 
product was gel purified and ligated into the pETl lPolCl that had been digested with 
Nsil/BamHI and gel purified (ligated product is pETl lPolC2). To complete the 
expression vector, an approximately 2080 bp PCR product was amplified over the two 
Nsil sites internal to the gene using the following primers: 



Upstream (SEQ. ID. No. 39) 




30 



Downstream (SEQ. ID. No. 40) 




34 
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The amplification cycling scheme was as described above except the 72°C elongation 
step was 2 min. The PCR product, and the pETl 1Po1C2 vector, were digested with 
Nsil and gel purified. The ligation mixture was transformed as described above and 
colonies were screened for the correct chimera (pETl lPolC). 

To express Pol ITJ-L polymerase, the pETl lPolC plasmid was 
transformed into E. coli strain BL21(DE3). 24 L of E. coli BL21(DE3)pETl lPolC 
were grown in LB media containing 50 ug/ml ampicillin at 37°C to an OD of 0.7 and 
then the temperature was lowered to I5°C. Cells were then induced for Pol m-L 
expression upon addition of 1 mM IPTG to produce the T7 RNA polymerase needed 
to transcribe polC. This step was followed by further incubation at 15°C for 18 h. 
Expression of S. aureus Pol ITJ-L polymerase was so high that it could easily be 
visualized by Coomassie staining of a SDS polyacrylamide gel of whole cells 
(Figure 2A). The expressed protein migrated in the SDS polyacrylamide gel in a 
position expected for a 1 65 kDa polypeptide. In this procedure, it is important that 
cells are induced at 15°C, as induction at 37°C produces a truncated version of Pol IH- 
L polymerase, of approximately 130 kDa. 

Cells were collected by centrifugation at 5°C. Cells (12 g wet weight) 
were stored at -70°C. The following steps were performed at 4°C. Cells were thawed 
and lysed in cell lysis buffer as described (final volume = 50 ml) and were passed 
through a French Press (Amico) at a minimum of 20,000 psi. PMSF (2 mM) was 
added to the lysate as the lysate was collected from the French Press. DNA was 
removed and the lysate was clarified by centrifugation. The supernatent was dialyzed 
for 1 h against Buffer A containing 50 mM NaCl. The final conductivity was 
equivalent to 190 mM NaCl. Supernatent (24 ml, 208 mg) was diluted to 50 ml using 
Buffer A to bring the conductivity to 96 mM MgCl 2 , and then was loaded onto an 8 
ml MonoQ column equilibrated in Buffer A containing 50 mM NaCl. The column 
was eluted with a 160 ml linear gradient of Buffer A from 50 mM NaCl to 500 mM 
NaCl. Seventy five fractions (1.3 ml each) were collected (Figure 2B). Aliquots were 
analyzed for their ability to synthesize DNA, and 20 \x\ of each fraction was analyzed 
by Coomassie staining of an SDS polyacrylamide gel. Based on the DNA synthetic 
capability, and the correct size band in the gel, fractions 56-65 containing Pol IU-L 
polymerase were pooled (22 ml, 31 mg). The pooled fractions were dialyzed 
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overnight at 4°C against 50 mM phosphate (pH 7.6), 5 mM DTT, 0.1 mM EDTA, 2 
mM PMSF, and 20 % glycerol (P-cell buffer). The dialyzed pool was loaded onto a 
4.5 ml phosphocellulose column equilibrated in P-cell buffer, and then eluted with a 
25 ml linear gradient of P-cell buffer from 0 M NaCl to 0.5 M NaCl. Fractions of 1 
5 ml were collected and analyzed in a SDS polyacrylamide gel stained with Coomassie 

Blue (Figure 2C). Fractions 20-36 contained the majority of the Pol IH-large at a 
purity of greater than 90 % (5 mg). 

Example 7 - S. aureus Pol III-L is Not Processive on its Own 

10 

The Pol ITJ-L polymerase purifies from B. subtilis as a single subunit 
without accessory factors (Barnes et al., "Purification of DNA Polymerase m of 
Gram-positive Bacteria," Methods in Enzv. , 262:35-42 (1995), which is hereby 
incorporated by reference). Hence, it seemed possible that it may be a Type I 

15 replicase (e.g., like T5 polymerase) and, thus, be capable of extending a single primer 

full length around a long singly primed template. To perform this experiment, a 
template M13mpl8 ssDNA primed with a single DNA oligonucleotide either in the 
presence or absence of SSB was used. DNA products were analyzed in a neutral 
agarose gel which resolved products by size. The results showed that Pol Ul-L 

20 polymerase was incapable of extending the primer around the DNA (to form a 

completed duplex circle referred to as replicative form 11 ("RFU")) whether SSB was 
present or not. This experiment has been repeated using more enzyme and longer 
times, but no full length RFII products are produced. Hence, Pol HI-L would appear 
not to follow the paradigm of the T5 system (Type I replicase) in which the 

25 polymerase is efficient in synthesis in the absence of any other protein(s). 

Example 8 - Cloning and Purification of S. aureus Beta Subunit 

The sequence of an S. aureus homolog of the E. coli dnaN gene 
30 (encoding the beta subunit) was obtained in a study in which the large recF region of 

DNA was sequenced (Alonso et al., "Nucleotide Sequence of the recF Gene Cluster 
From Staphylococcus aureus and Complementation Analysis in Bacillus subtilis recF 
Mutants," Mol. Gen. Genet. , 246:680-686 (1995), Alonso et al., "Nucleotide 
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Sequence of the recF Gene Cluster From Staphylococcus aureus and 
Complementation Analysis in Bacillus subtilis recF Mutants," Mol. Gen. Genet. . 
248:635-636 (1995), which are hereby incorporated by reference). Sequence 
alignment of the S. aureus beta and E. coli beta show approximately 30% identity. 
Overall this level of homology is low and makes it uncertain that S. aureus beta will 
have the same shape and function as the E. coli beta subunit. 

To obtain S. aureus beta protein, the dnaN gene was isolated and 
precisely cloned into a pET vector for expression in E. coli. S. aureus genomic DNA 
was used as template to amplify the homolog of the dnaN gene (encoding the putative 
beta). The upstream and downstream primers were designed to isolate the dnaN gene 
by PCR amplification from genomic DNA. Primers were: 

Upstream (SEQ. ID. No. 41) 

cgactggaag gagttttaac atatg atgga attcac 36 
Downstream (SEQ. ID. No. 42) 

ttatat ggat ccttagtaag ttctgattgg 30 

The Ndel site used for cloning into pET16b (Novagen) is underlined in the Upstream 
primer and the BamHI site used for cloning into pET16b is underlined in the 
Downstream primer. The Ndel and BamHI sites were used for directional cloning 
into pET16 (Figure 3). Amplification was performed using 500 ng genomic DNA, 0.5 
mM dNTPs, 1 nM of each primer, ImM MgSC>4, 2 units vent DNA polymerase in 100 
ul of vent buffer. Forty cycles were performed using the following cycling scheme: 
94°C, 1 min; 60°C, 1 min.; 72°C, 1 min. 10s. The 1 167 bp product was digested with 
Ndel and BamHI and purified in a 0.7 % agarose gel. The pure digested fragment was 
ligated into the pET16b vector which had been digested with Ndel and BamHI and gel 
purified in a 0.7% agarose gel. Ligated products were transformed into E. coli 
competent SURE D cells (Stratagene) and colonies were screened for the correct 
chimera by examining minipreps for proper length and correct digestion products 
using Ndel and BamHI. 

24 L of of BL21(DE3)pETbeta cells were grown in LB containing 50 
ng/ml ampicillin at 37°C to an O.D. of 0.7, and, then, the temperature was lowered to 
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1 5°C. IPTG was added to a concentration of 2 mM and after a further 1 8 h at 1 5°C to 
induce expression of S. aureus beta (Figure 4 A). It is interesting to note that the beta 
subunit, when induced at 37°C, was completely insoluble. However, induction of 
cells at 15°C provided strong expression of beta and, upon cell lysis, over 50% of the 
beta was present in the soluble fraction. 

Cells were harvested by centrifugation (44 g wet weight) and stored at - 
70°C. The following steps were performed at 4°C. Cells (44 g wet weight) were 
thawed and resuspended in 45 ml IX binding buffer (5 mM imidizole, 0.5 M NaCl, 20 
mM Tris HC1 (final pH 7.5)) using a dounce homogenizer. Cells were lysed using a 
French Pressure cell (Aminco) at 20,000 psi, and then 4.5 ml of 10 % polyamine P 
(Sigma) was added. Cell debris and DNA was removed by centrifugation at 13,000 
rpm for 30 min. at 4°C. The pET16beta vector places a 20 residue leader containing 
10 histidine residues at the N-terminus of beta. Hence, upon lysing the cells, the 
S. aureus beta was greatly purified by chromatography on a nickel chelate resin 
(Figure 4B). The supernatant (890 mg protein) was applied to a 10 ml HiTrap 
Chelating Separose column (Pharmacia-LKB) equilibrated in binding buffer. The 
column was washed with binding buffer, then eluted with a 100 ml linear gradient of 
60 mM imidazole to 1 M imidazole in binding buffer. Fractions of 1.35 ml were 
collected. Fractions were analyzed for the presence of beta in an SDS polyacrylamide 
gel stained with Coomassie Blue. Fractions 28-52, containing most of the beta 
subunit, were pooled (35 ml, 82 mg). Remaining contaminating protein was removed 
by chromatography on MonoQ. The S. aureus beta becomes insoluble as the ionic 
strength is lowered and, thus, the pool of beta was dialyzed overnight against Buffer A 
containing 400 mM NaCl. The dialyzed pool became slightly turbid indicating it was 
at its solubility limit at these concentrations of protein and NaCl. The insoluble 
materia] was removed by centrifugation (64 mg remaining) and, then, diluted 2-fold 
with Buffer A to bring the conductivity to 256. The protein was then applied to an 8 
ml MonoQ column equilibrated in Buffer A plus 250 mM NaCl and then eluted with a 
100 ml linear gradient of Buffer A from 0.25M NaCl to 0.75 M NaCl; fractions of 
1.25 ml were collected (Figure 4C). Under these conditions, approximately 27 mg of 
the beta flowed through the column and the remainder eluted in fractions 1-18 
(24 mg). 
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Example 9 - The S. aureus Beta Subunit Protein Stimulates S. aureus Pol HI-L 
and E. coli Core 

The experiment of Figure 5A, tests the ability of S. aureus beta to 
stimulate S. aureus Pol ITJ-L on a linear polydA-oligodT template. Reactions are also 
performed with E. coli beta and Pol m core. The linear template was polydA of 
average length of 4500 nucleotides primed with a 30mer oligonucleotide of T 
residues. The first two lanes show the activity of Pol IH-L either without (lane 1) or 
with S. aureus beta (lane 2). The result shows that the S. aureus beta stimulates Pol 
II1-L approximately 5-6 fold. Lanes 5 and 6 show the corresponding experiment using 
E. coli core with (lane 6) or without (lane 5) E. coli beta. The core is stimulated over 
1 0-fold by the E. coli beta subunit under the conditions used. 

Although Gram positive and Gram negative cells diverged from one 
another long ago and components of one polymerase machinery would not be 
expected to be interchangable, it was decided to test the activity of the S. aureus beta 
with E. coli Pol EI core. Lanes 3 and 4 shows that the S. aureus beta also stimulates 
E. coli core about 5-fold. This result can be explained by an interaction between the 
clamp and the polymerase that has been conserved during the evolutionary divergence 
of gram positive and gram negative cells. A chemical inhibitor that would disrupt this 
interaction would be predicted to have a broad spectrum of antibiotic activity, shutting 
down replication in Gram negative and Gram positive cells alike. This assay, and 
others based on this interaction, can be devised to screen chemicals for such 
inhibition. Further, since all the proteins in this assay are highly overexpressed 
through recombinant techniques, sufficient quantities of the protein reagents can be 
obtained for screening hundreds of thousands of compounds. 

In summary, the results show that S. aureus beta, produced in E. coli, is 
indeed an active protein (i.e., it stimulates polymerase activity). Furthermore, the 
results shows that Pol ITI-L functions with a second protein (i.e., S. aureus beta). 
Before this experiment, there was no assurance that Pol ELL, which is significantly 
different in structure from E. coli alpha, would function with another protein. For 
example, unlike E. coli alpha, which copurifies with several accessory proteins, Pol 
HI-L purified from B. subtilis as a single protein with no other subunits attached 
(Barnes et al., "Purification of DNA Polymerase ID of Gram-positive Bacteria," 
Methods in Enzv. , 262:35-42 (1995), which is hereby incorporated by reference). 
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Finally, if one were to assume that S. aureus beta would function with a polymerase, 
the logical candidate would have been the product of the dnaE gene (alpha-small) 
instead ofpolC (Pol ITJ-L) since the dnaE product is more homologous to E. coli alpha 
subunit than Pol ITI-L. 

Example 10 - The S. aureus Beta Subunit Behaves as a Circular Sliding Clamp 

The ability of S. aureus beta to stimulate Pol IH-L could be explained 
by formation of a 2-protein complex between Pol ITJ-L and beta to form a processive 
replicase similar to the Type n class (e.g., T7 type). Alternatively, the S. aureus 
replicase is organized as the Type HI replicase which operates with a circular sliding 
clamp and a clamp loader. In this case, the S. aureus beta would be a circular protein 
and would require a clamp loading apparatus to load it onto DNA. The ability of the 
beta subunit to stimulate Pol HI-L in Figure 5A could be explained by the fact that the 
polydA-oligodT template is a linear DNA and a circular protein could thread itself 
onto the DNA over an end. Such "end threading" has been observed with PCNA and 
explains its ability to stimulate DNA polymerase delta in the absence of the RFC 
clamp loader (Burgers et al., "ATP-Independent Loading of the Proliferating Cell 
Nuclear Antigen Requires DNA Ends," J. Biol. Chem. . 268:19923-19926 (1993), 
which is hereby incorporated by reference). 

To distinguish between these possibilities, S. aureus beta was 
examined for ability to stimulate Pol UI-L on a circular primed template. In 
Figure 5B, assays were performed using circular Ml 3mp 18 ssDNA coated with 
E. coli SSB and primed with a single oligonucleotide to test the activity of beta on 
circular DNA. Lane 1 shows the extent of DNA synthesis using Pol HI-L alone. In 
lane 2, Pol Hl-L was supplemented with S. aureus beta. The S. aureus beta did not 
stimulate the activity of Pol IH-L on this circular DNA (nor in the absence of SSB). 
Inability of S. aureus beta to stimulate Pol HI-L is supported by the results of Figure 6, 
lane 1 that analyzes the product of Pol ITJ-L action on the circular DNA in an agarose 
gel in the presence of S. aureus beta. In summary, these results show that S. aureus 
beta only stimulates Pol ITJ-L on linear DNA, not circular DNA. Hence, the S. aureus 
beta subunit behaves as a circular protein. 
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Lane 3 shows the result of adding both 5. aureus beta and E. coli 
gamma complex to Pol Ill-L. Again, no stimulation was observed (compare with lane 
1 ). This result indicates that the functional contacts between the clamp and clamp 
loader were not conserved during evolution of Gram positive and Gram negative cells. 
5 Controls for these reactions on circular DNA are shown for the E. coli 

system in Lanes 4-6. Addition of only beta to E. coli Pol UJ core did not result in 
stimulating the polymerase (compare lanes 4 and 5). However, when clamp loader 
complex was included with beta and core, a large stimulation of synthesis was 
observed (lane 6). In summary, stimulation of synthesis is only observed when both 
10 beta and clamp loader complex were present, consistent with inability of the circular 

beta ring to assemble onto circular DNA by itself. 

Example 11 - Pol III-L Functions as a Pol Ill-Type Replicase with Beta and a 
Clamp Loader Complex to Become Processive 

15 

Next, it was determined whether S. aureus Pol ITJ-L requires two 
components (a beta clamp and a clamp loader) to extend a primer full length around a 
circular primed template. In Figure 6, a template circular M13mpl 8 ssDNA primed 
with a single DNA oligonucleotide was used. DNA products were analyzed in a 

20 neutral agarose gel which resolves starting materials (labeled ssDNA in Figure 6) 

from completed duplex circles (labelled RFII for replicative form II). The first two 
lanes show, as demonstrated in other examples, that Pol ITJ-L is incapable of 
extending the primer around the circular DNA in the presence of only S. aureus beta. 
In lane 4 of Figure 6, E. coli clamp loader complex (also known as gamma complex) 

25 and beta subunit were mixed with S. aureus Pol m-L in the assay containing singly 

primed M13mpl 8 ssDNA coated with SSB. If the beta clamp, assembled on DNA by 
clamp loader complex, provides processivity to S. aureus Pol ITJ-L, the ssDNA circle 
should be converted into a fully duplex circle (RFII) which would be visible in an 
agarose gel analysis. The results of the experiment showed that the E. coli beta and 

30 clamp loader complex did indeed provide Pol ITJ-L with ability to fully extend the 

primer around the circular DNA to form the RFII (lane 4). The negative control using 
only E. coli clamp loader complex and beta is shown in lane 3. For comparison, lane 
6 shows the result of mixing the three components of the E. coli system (Pol ITJ core, 
beta, and clamp loader complex). This reaction gives almost exclusively full length 



WO 01/09164 



-97- 



PCT/USOO/20666 



RFD product. The qualitatively different product profile that Pol ITJ-L gives in the 
agarose gel analysis compared to E. coli Pol ID core with beta and clamp loader 
complex shows that the products observed using Pol Hl-L is not due to a contaminant 
of E. coli Pol HI core in the S. aureus Pol IH-L preparation (compare lanes 4 and 6). 

It is generally thought that the polymerase of one system is specific for 
its SSB. However, these reactions are performed on ssDNA coated with the E. coli 
SSB protein. Hence, the S. aureus Pol IH-L appears capable of utilizing E. coli SSB 
and the E. coli beta. It would appear that the only component that is not 
interchangeable between the Gram positive and Gram negative systems is the clamp 
loader complex. 

Thus, the S. aureus Pol HI-L functions as a Pol ID type replicase with 
the E. coli beta clamp assembled onto DNA by a clamp loader complex. 

Example 12 - Purification of Two DNA Polymerase Ill-Type Enzymes From 
S. aureus Cells 

The MonoQ resin by Pharmacia has very high resolution which would 
resolve the three DNA polymerases of S. aureus. Hence, S. aureus cells were lysed, 
DNA was removed from the lysate, and the clarified lysate was applied onto a MonoQ 
column. The details of this procedure are: 300 L of S. aureus (strain 4220, a gift of 
Dr. Pat Schlievert, University of Minnisota) was grown in 2X LB media at 37°C to an 
O.D. of approximately 1.5 and then were collected by centrifugation. Approximately 
2 kg of wet cell paste was obtained and stored at -70°C. 122 g of cell paste was 
thawed and resuspended in 192 ml of cell lysis buffer followed by passage through a 
French Press cell (Aminco) at 40,000 psi. The resultant lysate was clarified by high 
speed centrifugation (1.3 g protein in 120 ml). A 20 ml aliquot of the supernatant was 
dialyzed 2 h against 2 L of buffer A containing 50 mM NaCl. The dialyzed material 
(148 mg, conductivity = 101 mM NaCI) was diluted 2-fold with Buffer A containing 
50 mM NaCl and then loaded onto an 8 ml MonoQ column equilibrated in Buffer A 
containing 50 mM NaCl. The column was washed with Buffer A containing 50 mM 
NaCl, and then eluted with a 160 ml linear gradient of 0.05 M NaCl to 0.5 M NaCl in 
Buffer A. Fractions of 2.5 ml (64 total) were collected, followed by analysis in an 
SDS polyacrylamide gel for their replication activity in assays using calf thymus 
DNA. 
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Three peaks of DNA polymerase activity were identified (Figure 7). 
Previous studies of cell extracts prepared from the Gram positive organism Bacillus 
subtilis identified only two peaks of activity off a DEAE column (similar charged 
resin to MonoQ). The first peak was Pol n, and the second peak was a combination of 
DNA polymerases I and HI. The DNA polymerases I and III were then separated on a 
subsequent phosphocellulose column. The middle peak in Figure 7 is much larger 
than the other two peaks and, thus, it was decided to chromatograph this peak on a 
phosphocellulose column. The second peak of DNA synthetic activity was pooled 
(fractions 37-43; 28 mg in 14 ml) and dialyzed against 1.5 L P-cell buffer for 2.5 h. 
Then, the sample (ionic strength equal to 99 mM NaCl) was applied to a 5 ml 
phosphocellulose column equilibrated in P-cell buffer. After washing the column in 
10 ml P-cell buffer, the column was eluted with a 60 ml gradient of 0 - 0.5 M NaCl in 
P-cell buffer. Seventy fractions were collected and then analyzed for DNA synthesis 
using calf thymus DNA as template. This column resolved the polymerase activity 
into two distinct peaks (Figure 7B). 

Hence, there appear to be four DNA polymerases in Staphylococcus 
aureus. They were designated here as peak 1 (first peak off MonoQ), peak 2 (first 
peak off phosphocellulose), peak 3 (second peak of phosphocellulose), and peak 4 
(last peak off Mono Q) (see Figure 7). Peak 4 was presumably Pol ffl-L, as it elutes 
from MonoQ in a similar position as the Pol IH-L expressed in E. coli (compare 
Figure 7A with Figure 2). 

Example 13 - Demonstration That Peak 1 (Pol III-2) Functions as a Pol Ill-Type 
Replicase With E. coli Beta Assembled on DNA by E. coli Clamp 
Loader Complex. 

To test which peak contained a Pol UJ-type of polymerase, an assay 
was used in which the E. coli clamp loader complex and beta support formation of full 
length RFII product starting from E. coli SSB coated circular Ml 3mp 18 ssDNA 
primed with a single oligonucleotide. In Figure 8, both Peaks 1 and 2 are stimulated 
by the E. coli clamp loader complex and beta subunit and, in fact, Peaks 2 and 3 are 
inhibited by these proteins (the quantitation is shown below the gel in the figure). 
Further, the product analysis in the agarose gel shows full length RFII duplex DNA 
circles only for peaks 1 and 4. These results, combined with the NEM, pCMB, and 
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KC1 characteristics in Tables 2 and 3 below, suggest that there are two Pol Ill-type 
DNA polymerases in S. aureus and that these are partially purified in peaks 1 and 4. 

Next, it was determined which of these peaks of DNA polymerase 
activity correspond to DNA polymerases I, D, and IE, and which peak is the 
unidentified DNA polymerase. In the Gram postive bacterium B. subtilis, Pol HI is 
inhibited by pCMB, NEM, and 0.15 M NaCl, Pol U is inhibited by KC1, but not NEM 
or 0.1 5 M KCL, and Pol I is not inhibited by any of these treatments (Gass et al., 
"Further Genetic and Enzymological Characterization of the Three Bacillus subtilis 
Deoxyribonucleic Acid Polymerases," J. Biol. Chem. , 248:7688-7700 (1973), which 
is hereby incorporated by reference). Hence, assays were performed in the presence or 
absence of pCMB, NEM, and 0.15 M KC1 (see Tables 2 and 3 below). Peak 3 clearly 
corresponded to Pol I, because it was not inhibited by NEM, pCMB, or 0. 1 5 M NaCl. 
Peak 2 correspond to Pol II, because it was not inhibited by NEM, but was inhibited 
by pCMB and 0.15 M NaCl. Peaks 1 and 4 both had characteristics that mimic Pol 
HI; however, peak 4 elutes on MonoQ at a similar position as Pol Ui-L expressed in E. 
coli (see Figure 2B). Hence, peak 4 is likely Pol UJ-L, and peak 1 is likely the 
unknown polymerase. 



Table 2: Expected Characteristics of Polymerases 



Polymerase pCMB 


NEM 


0.15MKC1 


Pol I not inhibited* 


not inhibited 


not inhibited 


Pol II inhibited** 


not inhibited 


not inhibited 


Pol IIJ-L inhibited 


inhibited 


not inhibited 


* Not inhibited is defined as greater than 75% remaining activity 




** Inhibited is defined as less than 40% remaining activity 




Table 3: Observed Characteristics 






Peak pCMB 


NEM 


0.1 5M KCL assignment 


Peak I inhibited 


inhibited 


new polymerase 


Peak2 inhibited 


not inhibited 


PolE 


Peak3 not inhibited 


not inhibited 


Poll 


Peak4 inhibited 


inhibited 


Pol UJ-L 
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Example 14 - Identification and Cloning of S. aureus dnaE 

This invention describes the finding of two DNA polymerases that 
function with a sliding clamp assembled onto DNA by a clamp loader. One of these 
DNA polymerases is likely Pol IH-L, but the other has not been identified previously. 
Presumably, the chromatographic resins used in earlier studies did not have the 
resolving power to separate the enzyme from other polymerases. This would be 
compounded by the low activity of Pol IH-2. To identify a gene encoding the second 
Pol m, the amino acid sequences of the Pol III alpha subunit of Escherichia coli, 
Salmonella typhimurium, Vibrio cholerae, Haemophilis influenzae, and Helicobacter 
pylori were aligned using Clustal W (1.5). Two regions about 400 residues apart were 
conserved and primers were designed for the following amino acid sequences: 

Upstream, corresponding in E. coli to residues 385-399 (SEQ. ID. No. 43) 

Leu Leu Phe Glu Arg Phe Leu Asn Pro Glu Arg Val Ser Met Pro 
15 10 15 

Downstream, corresponding in E. coli to residues 750-764 (SEQ. ID. No. 44) 

Lys Phe Ala Gly Tyr Gly Phe Asn Lys Ser His Ser Ala Ala Tyr 
15 10 15 

The following primers were designed to these two peptide regions using codon 
preferences for S. aureus: 

Upstream (SEQ. ID. No. 45) 

cttctttttg aaagatttct aaataaagaa cgttattcaa tgcc 44 
Downstream (SEQ. ID. No. 46) 

ataagctgca gcatgacttt tattaaaacc ataacctgca aattt 45 

Amplification was performed using 2.5 units of Taq DNA Polymerase (Gibco, BRL), 
100 ng S. aureus genomic DNA, 1 mM of each of the four dNTPs, 1 uM of each 
primer, and 3 mM MgCb in 100 |al of Taq buffer. Thirty-five cycles of the following 
scheme were repeated: 94°C, 1 min; 55°C, 1 min; 72°C, 90 sec. The PCR product 
(approximately 1.1 kb) was electrophoresed in a 0.8 % agarose gel and purified using 
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a Geneclean III kit (Bio 101). The product was then divided equally into ten separate 
aliquots and used as a template for PCR reactions, according to the above protocol, to 
reamplify the fragment for sequencing. The final PCR product was purified using a 
Quiagen Quiaquick PCR Purification kit, quantitated via optical density at 260 nM, 
5 and sequenced by the Protein/DNA Technology Center at Rockefeller University. The 

same primers used for PCR were used to prime the sequencing reactions. 

Next, the following additional PCR primers were designed to obtain 
more sequence information 3' to the first amplified section. 

10 Upstream (SEQ. ID. No. 47) 

agttaaaaat gccatatttt gacgtgtttt agttctaat 39 

Downstream (SEQ. ID. No. 48) 

cttgcaaaag cggttgctaa agatgttgga cgaattatgg gg 42 

15 

These primers were used in a PCR reaction using 2.5 units ofTaq DNA Polymerase 
(Gibco, BRL) with 100 ng S. aureus genomic DNA as a template, ImM dNTP's, 
1 pM of each primer, and 3 mM MgCl 2 in 100 1 of Tag buffer. Thirty-five cycles of 
the following scheme were repeated: 94°C, 1 min; 55°C, 1 min; 72°C, 2 min 30 

20 seconds. The 1.6 Kb product was then divided into 5 aliquots, and used as a template 

in a set of 5 PCR reactions, as described above, to amplify the product for sequencing. 
The products of these reactions were purified using a Qiagen Qiaquick PCR 
Purification kit, quantitated via optical density at 260 nm, and sequenced by the 
Protein/DNA Technology Center at Rockefeller University. The sequence of this 

25 product yielded about 740 bp of new sequence 3' of the first sequence. 

As this gene shows better homology to the Gram negative Pol ITJ a 
subunit compared to Gram positive Pol III-L, it will be designated the dnaE gene. 

Example 15 - Identification and Cloning of S. aureus dnaX 

30 

The fact that the S. aureus beta stimulates Pol IH-L and has a ring 
shape suggests that the Gram postive replication machinery is of the three component 
type. This implies the presence of a clamp loader complex. This is not a simple 
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determination to make as the B. subtilis genome shows homologs to only two of the 
five subunits of the E. coli clamp loader {dnaX encoding gamma, and holB encoding 
delta prime). On the basis of the experiments in this application, which suggests that 
there is a clamp loader, it was believed that these two subunit homologues are part of 
the clamp loader for the S. aureus beta. 

As a start in obtaining the clamp loading apparatus, a strategy was 
devised to obtain the gene encoding the tau subunit of S. aureus. In E. coli, the tau 
and gamma subunits are derived from the same gene. Tau is the full length product, 
and gamma is about 2/3 the length of tau. Gamma is derived from the dnaX gene by 
what was originally believed to be an efficient translational frameshift mechanism 
that, after it occurs, incorporates only one unique C-terminal residue before 
encountering a stop codon. To identify the dnaX gene of S. aureus by PCR analysis, 
the dnaX genes of B. subtilis, E. coli, and //. influenzae were aligned. Upon 
comparison of the amino acid sequence encoded by these dnaX genes, two areas of 
high homology were used to predict the amino acid sequence of the S. aureus dnaX 
gene product. PCR primers were designed to these sequences, and a PCR product of 
the expected size was indeed produced. DNA primers were designed to two regions 
of high similarity for use in PCR that were about 100 residues apart. The amino acid 
sequences of these regions were: 

Upstream, corresponding to residues 39-48 of E. coli (SEQ. ID. No. 49) 

His Ala Tyr Leu Phe Ser Gly Pro Arg Gly 
1 5 10 

Downstream, corresponding to residues 138-148 of E. coli (SEQ. ID. No. 50) 
His Ala Tyr Leu Phe Ser Gly Pro Arg Gly 
15 10 

The DNA sequence of the PCR primers was based upon the codon usage of S. aureus. 
The primers are as follows: 



Upstream (SEQ. ID. No. 51) 

cgc ggatcc c atgcatattt attttcaggt ccaagagg 



38 
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Downstream (SEQ. ID. No. 52) 

ccg gaattc t ggtggttctt ctaatgtttt taataatgc 39 

The first 9 nucleotides of the upstream primer (SEQ. ID. No. 51) contain a BamHI 
5 site, which is underlined, and do not correspond to amino acid codons; the 3' 29 

nucleotides correspond to the amino acid sequence of SEQ. ID. No. 49. The EcoRI 
site of the downstream primer (SEQ. ID. No. 52) is underlined and the 3' 33 
nucleotides correspond to the amino acid sequence of SEQ. ID. No. 50. 

The expected PCR product, based on the alignment, is approximately 

10 268 bp between the primer sequences. Amplification was performed using 500 ng 

genomic DNA, 0.5 mM dNTPs, 1 nM of each primer, 1 mM MgS04, 2 units vent 
DNA polymerase in 100 pi of vent buffer. Forty cycles were performed using the 
following cycling scheme: 94°C, 1 min; 60°C, 1 min.; 72°C, 30s. The approximately 
300 bp product was digested with EcoRI and BamHI and purified in a 0.7 % agarose 

15 gel. The pure digested fragment was ligated into pUCl 8 which had been digested 

with EcoRI and BamHI and gel purified in a 0.7 % agarose gel. Ligated products 
were transformed into E. coli competent DH5a cells (Stratagene), and colonies were 
screened for the correct chimera by examining minipreps for proper length and correct 
digestion products using EcoRI and BamHI. The sequence of the insert was 

20 determined and was found to have high homology to the dnaX genes of several 

bacteria. This sequence was used to design circular PCR primers. Two new primers 
were designed for circular PCR based on this sequence. 

A circular PCR product of approximately 1 .6 kb was obtained from a 
HincII digest of chromosomal DNA that was recircularized with ligase. This first 

25 circular PCR yielded most of the remaining dnaX gene. The two primers were as 

follows: 

Rightward (SEP. ID. No. 53) 

tttgtaaagg cattacgcag gggactaatt cagatgtg 38 

30 



Leftward (SEQ. ID. No. 54) 

tatgacattc attacaaggt tctccatcag tgc 



33 
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Genomic DNA (3 ng) was digested with Hindi, punfied with phenol/chloroform 
extraction, ethanol precipitated and redissolved in 70 »l I.E. buffer. The genomic 
DNA was recircularized upon adding 4000 units T4 ligase (New England Biolabs) in 
a final volume of 100 nl T4 ligase buffer (New England Biolabs) at 16°C overnight. 
The PCR reaction consisted of 90 ng recircularized genomic DNA, 0.5 mM each 
dNTP, 100 pmol of each primer, 1.4 mM magnesium sulfate, and 1 unit of elongase 
(GIBCO) in a final volume of 100 pi elongase buffer (GIBCO). 40 cycles were 
performed using the following scheme: 94°C, 1 min.; 55°C, 1 min.; and 68°C, 2 min. 
The resulting PCR product was approximately 1 .6 kb. The PCR product was purified 
from a 0.7 % agarose gel and sequenced directly. A stretch of approximately 750 
nucleotides was obtained using the rightward primer used in the circular PCR 
reaction. To obtain the rest of the sequence, other sequencing primers were designed 
in succession based on the information of each new sequencing run. 

This sequence, when spliced together with the previous 300 bp PCR 
sequence, contained the complete N-terminus of the gene product (stop codons are 
present upstream) and possibly lacked only about 50 residues of the C-terminus. The 
amino terminal region of E. coli tau shares what appears to be the most conserved 
region of the gene as this area shares homology with RFC subunit of the human clamp 
loader and with the gene 44 protein of the phage T4 clamp loader. An alignment of 
the N-terminal region of the S. aureus tau protein with that of B. subtilis and E. coli is 
shown in Figure 1 0. Among the highly conserved residues are the ATP binding site 
consensus sequence and the four cystine residues that form a Zn 2+ finger. 

After obtaining 1 kb of sequence in the 5' region of dnaX, it was 
sought to determine the remaining 3' end of the gene. Circular PCR products of 
approximately 800bps, 600bps, and 1600bps were obtained from Apo I, or Nsi I or 
Ssp I digest of chromosomal DNA that were recircularized with ligase. 

Rightward (SEQ. ED. No. 55) 

gagcactgat gaacttagaa ttagatatg 29 



Leftward (SEQ. ID. No. 56) 

gatactcagt atctttctca gatgttttat tc 



32 
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Genomic DNA (3 g) was digested with, Apo I, or Nsi I or Ssp I, purified with 
phenol/chloroform extraction, ethanol precipitated, and redissolved in 70 1 T.E. buffer. 
The genomic DNA was recircularized upon adding 4000 units of T4 ligase (New 
England Biolabs) in a final volume of 100 1 T4 ligase buffer (New England Biolabs) at 
16°C overnight. The PCR reaction consisted of 90 ng recircularized genomic DNA, 
0.5 mM each dNTP, 100 pmol of each primer, 1.4 mM magnesium sulfate, and 1 unit 
of elongase (GIBCO) in a final volume of 100 1 elongase buffer (GIBCO). 40 cycles 
were performed using the following scheme: 94°C, 1 min.; 55°C, 1 min.; 68°C, 2 min. 
The PCR products were directly cloned into pCR II TOPO vector using the TOPO TA 
cloning kit (Invitrogen Corporation) for obtaining the rest of the C terminal sequence 
of S. aureus dnaX. DNA sequencing was performed by the Rockefeller University 
sequencing facility. 

Example 16 - Identification and Cloning of S. aureus dnaB 

In E. coli, the DnaB helicase assembles with the DNA polymerase III 
holoenzyme to form a replisome assembly. The DnaB helicase also interacts directly 
with the primase to complete the machinery needed to duplicate a double helix. As a 
first step in studying how the S. aureus helicase acts with the replicase and primase, S. 
aureus was examined for presence of a dnaB gene. 

The amino acid sequences of the DnaB helicase of Escherichia coli, 
Salmonella typhimurium, Haemophilis influenzae, and Helicobacter pylori were 
aligned using Clustal W (1 .5). Two regions about 200 residues apart showed good 
homology. These peptide sequences were: 

Upstream, corresponding to residues 225-238 of E. coli DnaB (SEQ. ID. No. 57) 

Asp Leu lie lie Val Ala Ala Arg Pro Ser Met Gly Lys Thr 
1 5 10 

Downstream, corresponding to residues 435-449 of E. coli DnaB (SEQ. ID. No. 58) 

Glu He He He Gly Lys Gin Arg Asn Gly Pro He Gly Thr Val 
15 10 15 

The following primers were designed from regions which contained conserved 
sequences using codon preferences for S. aureus: 
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Upstream (SEQ. ED. No. 59) 

gaccttataa ttgtagctgc acgtccttct atgggaaaaa c 41 

5 Downstream (SEQ. ID. No. 60) 

aacattatta agtcagcatc ttgttctatt gatccagatt caacgaag 48 

A PCR reaction was carried out using 2.5 units of Taq DNA Polymerase (Gibco, 
BRL) with 100 ng. S. aureus genomic DNA as template, 1 mM dNTP's, l|aM of each 

10 primer, 3 mM MgCb in 100 |al of Taq buffer. Thirty-five cycles of the following 

scheme were repeated: 94°C, 1 min.; 55°C, 1 min.; and 72°C, 1 min. Two PCR 
products were produced, one was about 1.1 kb, and another was 0.6 kb. The smaller 
one was the size expected. The 0.6 kb product was gel purified and used as a template 
for a second round of PCR as follows. The 0.6 kb PCR product was purified from a 

15 0.8% agarose gel using a Geneclean ID kit (Bio 101) and then divided equally into 

five separate aliquots, as a template for PCR reactions. The final PCR product was 
purified using a Quiagen Quiaquick PCR Purification kit, quantitated via optical 
density at 260 nM, and sequenced by the Protein/DNA Technology Center at 
Rockefeller University. The same primers used for PCR were used to prime the 

20 sequencing reaction. The amino acid sequence was determined by translation of the 

DNA sequence in all three reading frames, and selecting the longest open reading 
frame. The PCR product contained an open reading frame over its entire length. The 
predicted amino acid sequence shares homology to the amino acid sequences encoded 
by dnaB gene of other organisms. 

25 Additional sequence information was determined using the circular 

PCR technique. Briefly, S. aureus genomic DNA was digested with various 
endonucleases, then religated with T4 DNA ligase to form circular templates. To 
perform PCR, two primers were designed from the initial sequence. 

30 First primer (SEQ. ID. No. 6 1 ) 

gatttgtagt tctggtaatg ttgactcaaa ccgcttaaga accgg 45 

Second primer (SEQ. ED. No. 62) 

atacgtgtgg ttaactgatc agcaacccat ctctagtgag aaaatacc 48 
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The first primer matches the sequence of the coding strand and the second primer 
matches the sequence of the complementary strand. These two pnmers are directed 
outwards from a central point, and allow determination of new sequence information 
5 up to the ligated endonuclease site. A PCR product of approximately 900 bases in 

length was produced using the above primers and template derived from the ligation 
of S. aureus genomic DNA which had been cut with the restriction endonuclease Apo 
I. This PCR product was electrophoresed in a 0.8% agarose gel, eluted with a Qiagen 
gel elution kit, divided into five separate aliquots, and used as a template for 

10 reamplification by PCR using the same primers as described above. The final product 

was electrophoresed in an 0.8% agarose gel, visualized via staining with ethidium 
bromide under ultraviolet light, and excised from the gel. The excised gel slice was 
frozen, and centrifuged at 12,000 rpm for 15 minutes. The supernatant was extracted 
with phenol/chloroform to remove ethidium bromide, and was then cleaned using a 

15 Qiagen PCR purification kit. The material was then quantitated from its optical 

density at 260 nm and sequenced by the Protein/DNA Technology Center at the 
Rockefeller University. 

The nucleotide sequence contained an open reading frame over its 
length, up to a sequence which corresponded to the consensus sequence of a cleavage 

20 site of the enzyme Apo I. Following this point, a second open reading frame encoded 

a different reading frame up to the end of the product. The inital sequence 
information was found to match the inital sequence and to extend it yet further 
towards the C-terminus of the protein. The second reading frame was found to end in 
a sequence which matched the 5 '-terminus of the previously determined sequence and, 

25 thus, represents an extension of the sequence towards the N-terminus of the protein. 

Additional sequence information was obtained using the above primers 
and a template generated using 5 1 . aureus genomic DNA circularized via ligation with 
T4 ligase following digestion with Cla I. The PCR product was generated using 35 
cycles of the following program: denaturation at 94°C for 1 min.; annealing at 55°C 

30 for 1 min.; and extension at 68°C for 3 minutes and 30 s. The PCR products were 

electrophoresed in a 0.8% agarose gel, eluted with a Qiagen gel elution kit, divided 
into five separate aliquots, and used as a template reamplification via PCR with the 
same primers described above. The final product was electrophoresed in an 0.8% 
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agarose gel, visualized via staining with ethidium bromide under ultraviolet light, and 
excised from the gel. The excised gel slice was frozen, and centnfuged at 12,000 rpm 
for 15 min. The supernatant was cleaned using a Qiagen PCR purification kit. The 
material was then quantitated via optical density at 260 nm and sequenced by the 
5 Protein/DNA Technology Center at Rockefeller University. The open reading frames 

continued past 500 bases. Therefore, the following additional sequencing primers 
were designed from the sequence to obtain further information: 

First primer (SEQ. ID. No. 63) 
10 cgttttaatg catgcttaga aacgatatca g 31 

Second primer (SEQ. ID. No. 64) 

cattgctaag caacgttacg gtccaacagg c 31 

15 The N-terminal and C-terminal nucleotide sequence extensions 

generated using this circular PCR product completed the 5' region of the gene 
(encoding the N-terminus of DnaB); however, a stop codon was not reached in the 3' 
region and, thus, a small amount of sequence is still needed to complete this gene. 

The alignment of the S. aureus dnaB with E. coli dnaB and the dnaB 

20 genes of B. subtilis and S. typhitnurium is shown in Figure 1 1 . 

Example 17 - Identification and Cloning of S. aureus holB 

The S. aureus holB was identified by searching the S. aureus database 
25 with the sequences of S. pyogenes 5' subunit. The S. aureus holB encodes a 253 

residue protein of about 28 kDa. The holB gene was amplified by PCR using an 
upstream 69-mer primer as follows: 

Upstream Primer (SEQ. ID. No. 65): 
30 ggataacaat tccccgctag caataatttt gtttaacttt aagaaggaga tatac ccatg 60 

gatg aacag 69 

which contains an Ncol site (underlined), and a downstream 39-mer primer as 
follows: 
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Downstream Primer (SEQ. ID. No. 66): 

aattttaaag gatcc gtgta taatattcta attttcccg 39 

5 which contains a BamHI site (underlined). The PCR product was digested with Ncol 

and BamHI, purified, and ligated into the Ncol and BamHI sites of pETl la to produce 
plasmid pETSaholB. 

Example 18 - Purification of S. aureus 5' 

10 

The pETSaholB plasmid of Example 17 was transformed into E. 
coli BL21(DE3)recA A single colony was used to innoculate 2L of LB media 
supplemented with 200 pg/ml ampicillin. Cells (2L) were grown at 37°C to 
OD 6 oo=0.5 at which point the temperature was lowered to 15°C and 0.5 mM FPTG 

15 was added. After 16 hr of induction, cells were collected by centrifugation, 

resuspended in 50 mM Tris-HCl (pH 7.5), 10% sucrose, 1 M NaCl, 30 mM 
spermidine, 5 mM DTT, and 2 mM EDTA. Cells were lysed by two passages through 
a French press (15,000 psi), followed by centrifugation at 13,000 rpm for 30 min at 
4°C. Ammonium sulfate (0.3 g/ml) was added to the clarified lysate. The pellet was 

20 backwashed in 30 ml buffer A containing 0. 1 M NaCl and 0.24 g/ml ammonium 

sulfate using a Dounce homogenizer, then the pellet was recovered by centrifugation. 
The resulting pellet was resuspended in 20 ml of buffer A and dialyzed against buffer 
A. The dialyzed protein was applied to a 20 ml FFQ Sepharose column equilibrated 
in buffer A and eluted with a 200 ml linear gradient of 0 - 500 mM NaCl in buffer A; 

25 80 fractions were collected. Peak fractions (54 - 75) were combined (72 mg) and 

dialyzed against buffer A. The 5' preparation was aliquoted and stored frozen at - 
80°C. 

Example 19 - Identification and Cloning of S. aureus holA 

30 

The S. aureus holA gene was identified by searching the S. aureus 
database with the sequences of E. coli and S. pyogenes 6 subunits. The S. aureus holA 
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gene encodes a 288 residue protein of about 32 kDa. The holA gene was amplified by 
PCR using an upstream 28-mer primer as follows: 

Upstream Primer (SEQ. ID. No. 67): 
5 gggagtttgt aat ccatgg a tgaacagc 28 

which contains a Ncol site (underlined), and a downstream 37-mer primer as follows: 

Downstream Primer (SEQ. ID. No. 68): 

10 ctgaacacct attac cctag gcatctaact cacaccc 37 

which contains a BamHI site (underlined). The PCR product was digested with Ncol 
and BamHI, purified, and ligated into the Ncol and BamHI sites of pETl la to produce 
plasmid pETSaholA. 

15 

Example 20 - Purification of S. aureus 5 

The pETSaholA plasmid of Example 19 was transformed into E. coli 
NovaBlue (recAI \ac[F'proA + B + lac"ZAM15::TnI0(Tc R )) (Novagen). A single 

20 colony was used to innoculate 12L of LB media supplemented with 200 (ig/ml 

ampicillin. Cells (12L) were grown at 37°C to OD6oo=0.5 at which point the 
temperature was lowered to 15°C and 0.5 mM D?TG was added. After 16 br of 
induction, cells were collected by centrifugation, resuspended in 50 mM Tris-HCl (pH 
7.5), 10% sucrose, 1M NaCl, 30 mM spermidine, 5 mM DTT, and 2 mM EDTA. 

25 Cells were lysed by two passages through a French press (15,000 psi), followed by 

centrifugation at 13,000 rpm for 30 min at 4°C. Ammonium sulfate (0.3 g/ml) was 
added to the clarified lysate. The resulting pellet was resuspended in 250 ml of buffer 
A. The dialyzed protein was applied to a 100 ml FFQ Sepharose column equilibrated 
in buffer A and eluted with a 1000 ml linear gradient of 0 - 500 mM NaCl in buffer A; 

30 80 fractions were collected. Peak fractions (40-49) were combined (65 mg) and 

dialyzed against buffer A. The dialyzed protein was applied to a 8 ml MonoQ 
Sepharose column equilibrated in buffer A and eluted with a 80 ml linear gradient of 0 
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- 500 mM NaCl in buffer A; 80 fractions were collected. Peak fractions of the 5 
preparation were stored frozen at -80°C. 

Example 21 - Consitution of a Processive S. aureus DNA Polymerase III Enzyme 
5 from Three Components 

The PolC (alpha-large) requires the p clamp for processivity, which in 
tum requires the clamp loader (t55') for assembly onto DNA. The S. aureus clamp 
loader, x88' complex, was assembled by mixing the three proteins as follows: 400 \xg 

10 of t and 80 ng each of 5 and 6' were mixed in buffer A containing no NaCl and 

preincubated at 15°C for 10 min. The mixture was injected onto a 1 ml MonoQ 
column equilibrated in buffer A, and then eluted with a 30 ml linear gradient of 0-500 
mM NaCl in buffer A; 60 fractions were collected. Fractions were analyzed in a 10% 
SDS-polyacrylamide gel stained with Coomassie Blue. Peak fractions (40-50) were 

15 combined and concentrated using a Centricon 30 concentrator. 

The ability of the three components to work together to form the 
processive Pol m was tested by determining whether t5S' and p clamp could confer 
the ability of PolC to completely extend a single primer full circle around a large 7.2 
kb circular Ml 3mp 18 ssDNA genome. Replication reaction contained 70 ng (25 

20 fmol) on singly primed Ml 3mpl 8 ssDNA, 20 ng S. aureus P, 50 ng S. aureus PolC, 

either 30 ng or 90 ng of S. aureus t68' (when indicated), and 0.82 jig of 5". pyogenes 
SSB in 24 \i\ of 20 mM Tris-HCl (pH 7.5), 4% glycerol, 0.1 mM EDTA, 5 mM DTT, 
2 mM ATP, 8 mM MgCl 2 , 40 jig/ml BSA, and 60 mM each of dGTP and dCTP. 
Reactions were pre-incubated for 2 min at 37°C to assemble protein complexes on the 

25 primer terminus. DNA synthesis was initiated upon addition of 1 .5 u.1 dATP and 32 P- 

TTP (specific activity 2,000-4,000 cpm/pmol) and synthesis was allowed to proceed 
for 1 min before being quenched with an equal volume (25 p.1) of a solution of 1% 
SDS and 40 mM EDTA. One-half of the quenched reaction was analyzed for total 
DNA synthesis using DE8 1 paper as described, and the other half was analyzed by 

30 agarose gel phoresis. An autoradiogram of the agarose gel analysis of the replication 

products is depicted in Figure 13, which shows that the presence of PolC and P, but 
absence of i56' (lane 1) gives no full length circular duplex (RFU). However, in the 
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presence of t56' (lanes 2 and 3), full length circular duplex DNA (RFII) is produced, 
as expected for the action of a processive Pol ID holozyme. 

Example 22 - General Induction/Purification Conditions for S. pyogenes 

5 

The purification protocols for S. pyogenes proteins were performed 
using following standardized conditions. Cells were grown from a single colony, 
freshly transformed overnight. Cells were grown in 200 ug/ml Ampicillin to 
OD600=0. 3-0.4, at which point cultures were chilled prior to addition of fPTG (to a 

10 final concentration of 0.5 mM) and were allowed to incubate for 16 hrs at 15°C. 

Following this, all procedures were performed at 4°C. Cell paste (1-2 g/liter of 
culture) was resuspended (10 ml/g cell paste) in 50 mM Tris-HCl (pH 7.5)/10% 
Sucrose/1 M NaCl/5 mM DTT/ 30 mM Spermidine/lX Heat lysis buffer (50 mM 
Tris-HCl (pH 7.5), 1% Sucrose, 100 mM NaCl, 2 mM EDTA). Cells were lysed by 

15 two passages through the French Press (15,000 psi) followed by centrifugation at 

14,000 rpm at 4 °C. Ammonium sulfate, when added to the cleared lysate, was added 
gradually. Precipitate was allowed to settle on ice for a minimum of 30 min prior to 
collection by centrifugation. Protein pellets were resuspended in buffer A (50 mM 
Tris-HCl pH 7.5, 1 mM EDTA, 5 mM DTT, 10% glycerol) and dialyzed for over 3 

20 hours in the same buffer. Column design is based on the manufacturer's suggested 

capacities: Fast Flow Q (FFQ) and MonoQ are 20 mg protein /ml resin, Heparin- 
Affigel agarose is 1.2 mg protein/ml resin. Elution was performed using 10 column 
volume (c.v.) gradients, and the entire gradient elution profile was collected in 80 
fractions. Unless mentioned otherwise all columns were equilibrated and eluted with 

25 buffer A. 

Example 23 - Identification of a 5. pyogenes holA gene Encoding a Functional 
Delta Subunit and Purification of the Delta Subunit 

30 Alignment of E. coli delta subunit with 1 0 other putative holA products 

from unfinished genome databases of Gram negative bacteria indicates a region of 
conserved amino acid sequence. Amino acids Q 140 to L230 of E. coli delta were 
used to search the B. subtilis genome database for a Gram positive delta homolog. 
This search revealed yqeN, a potential reading frame of unknown function, as the 
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highest scoring sequence. Although the score was low, it was treated as a candidate 
for Gram positive delta. The alignment with E. coli delta is shown in Figure 12 A. A 
Streptococcus pyogenes genome database was searched wWhyqeN. Two contigs 
which represent N- (contig 206) and C- (contig 264) termini of S. pyogenes delta 
5 subunit were identified. The alignment of the putative S. pyogenes holA with B. 

subtilis yqeN is shown in Figure 1 2B. The following primers were used to obtain 
PCR products for delta subunit: 

holA Upstream (SEQ. ID No. 69) 

10 ggagcagatt gcttttgata catatgattg gcctattc 38 

holA Downstream (SEQ. ID No. 70) 

ttgtctccgc atcaaactgg gatccaagag catcatacgc gtatgg 46 

15 These primers were used to amplify the holA gene from S. pyogenes genomic DNA. 

The PCR product was digested with Ndel and BamHI, purified and ligated into the 
pETl la vector to produce pETl la.S.p. holA. 

The pETl la.S.p. holA plasmid was transformed into the 
BL21(DE3)RecA- strain of E. coli. A single colony from an overnight transformation 

20 was used to innoculate 12L LB broth supplemented with 200 ug/ml Ampicillin. Cells 

were grown at 37°C to OD600=0.5, at which point the temperature was lowered to 
15°C and 0.5 mM IPTG was added. Induction proceeded for 16 hrs. In the morning, 
cells were collected by centrifugation and resuspended in 50 mM Tris-HCl (pH 7.5)/ 
10% Sucrose /IX Heat Lysis Buffer/1 M NaCl/30 mM Spermidine/5 mM DTT. Cells 

25 were lysed by two passages through the French press (15,000 psi), followed by 

centrifugation at 13,000 rpm for 30 min. The supernatant was decanted and 
ammonium sulfate was added to a final concentration of 0.226 g/ml. The resulting 
pellet was collected by centrifugation and resuspended in 20 ml of buffer A. The 
resuspended pellet was dialyzed against buffer A containing no salt. The dialyzed 

30 protein (500 mg) was loaded onto a FFQ- Sepharose (35 ml) column and eluted with a 

linear gradient from 0 - 500 mM NaCl ( 10 c.v.). The peak fractions (21-45) were 
combined and dialyzed against buffer A (0 NaCl) for 3 hrs, then diluted to a 
conductivity of 50 mM NaCl and loaded (160 mg) onto a 120 ml Heparin- Affigel 
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column. Protein was eluted with a linear gradient of 0-500 mM NaCl (10 c.v.). The 
fractions containing the least contaminants (39-51) were precipitated with ammonium 
sulfate (0.226 g), collected by centnfugation, resuspended 5 ml of buffer A, and 
dialyzed in buffer A containing 200 mM NaCl. The delta subunit was stored at - 
80°C. The final delta subunit preparation is shown in the lane marked 5 of the 
Coomassie Blue stained SDS-polyacrylamide gel of Figure 14. Yield = 65 mg. 

Example 24 - Identification of S. pyogenes holB Encoding Delta Prime and 
Purification of the Delta Prime Subunit 

A search of the S. pyogenes genome database with the predicted B. 
subtilis delta prime amino acid sequence revealed aDNA sequence in contig #209 
(previously known as contig #210) that predicted a high scoring match for a gene 
encoding a delta prime protein. The following primers were used to obtain PCR 
products for holB: 

holB Upstream (SEQ. ID. No. 71) 

gcctaggata agggagggta catatggatt tagcgc 36 
holB Downstream (SEP. ID. No. 72) 

cgggcaagtc ttttgacaag cttcggatcc ccataacgaa ttcc 44 

The PCR product obtained from these primers was digested with Ndel and BamHI, 
purified and ligated into the pETl la vector to produce pETlla.S.p. holB. 

The pETl la.S.p.holB plasmid was transformed into the 
BL21(DE3)RecA- strain of E. coli. A single colony from an overnight transformation 
was used to innoculate 12L LB broth supplemented with 200 ug/ml Ampicillin. Cells 
were grown at 37°C to O.D. 600=0.4, at which point the temperature was lowered to 
15°C and 0.5 mM IPTG was added. Induction proceeded for 16 hrs. In the morning, 
cells were collected by centrifugation and resuspended in 100 ml 50 mM Tris-HCl 
(pH 7.5)/ 10% Sucrose /IX Heat Lysis Buffer. Lysis was initiated upon addition of 
0.4 mg/ml lysozyme followed by a 1 hr incubation on ice. Lysate was clarified by 
centrifugation at 13,000 rpm for 30 min. Ammonium sulfate was added to the 
supernatant to a final concentration of 0.3 g/ml. The protein pellet was resuspended in 
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buffer A(0.1 M NaCl) + 0.24 g/ml ammonium sulfate and clarified by centnfugation. 
The resulting protein pellet was resuspended in 20 ml of buffer A and dialyzed against 
buffer A. The dialyzed protein (450 mg) was loaded onto a 30 ml FFQ- Sepharose 
column and eluted with a linear gradient from 0 - 500 mM NaCl. The peak fractions 
were combined (fr# 20-30 containing 130 mg) and dialyzed against buffer A and 
loaded (70 mg) onto a 50 ml Heparin-Affigel column. Protein was eluted with a 
linear gradient of 0-500 mM NaCl. Delta prime binds weakly to both resins and elutes 
in the beginning of the gradient. This delta prime subunit was stored frozen at - 80°C. 
The final delta prime subunit preparation is shown in lane marked 5' of the Coomassie 
Blue stained SDS-polyacrylamide gel of Figure 14. Yield = 40 mg. 

Example 25 - Identification of the S. pyogenes dnaX Gene and Purification of the 
Tail Subunit 

A search of the S. pyogenes genome database with the putative B. 
subtilis tau amino acid sequence revealed a DNA sequence in contig #284 (previously 
known as contig # 289) with a high scoring match which predicted a gene encoding 
for a tau subunit protein. A set of PCR primers to 5'- and 3'- termini of the putative 
gene sequence were designed to include restriction enzyme recognition sequences for 
Ndel and BamHl sites, respectively. These primers are: 

dnaX Upstream (SEQ. ID. No. 73) 

ggagttaaaa acatatgtat caagctcttt ate 33 
dnaX Downstream (SEQ. ID. No. 74) 

cgtgggtaag ggcaaaaegg atcccttatg tatttcag 38 

A PCR product obtained with the above primers was digested with Ndel and BamHI, 
purified and ligated into pETl la vector to produce pETl la.S.p.dnaX. 

The pETl la.S.p.dnaX plasmid was transformed into the 
BL21(DE3)RecA- strain of E. coli. A single colony from an overnight transformation 
was used to innoculate 24L LB broth supplemented with 200 ng/ml Ampicillin. Cells 
were grown at 37°C to O.D.600=0.5, at which point the temperature was lowered to 
15°C and 0.5 mM IPTG was added. Induction proceeded for 16 hrs. In the morning, 
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cells were collected by centrifugation and resuspended in 200 mis of 50 mM Tris-HCl 
(pH 7.5)/ 10% Sucrose /IX Heat Lysis Buffer/IM NaCl/30 mM Spermidine/5 mM 
DTT/5 mM EDTA. Cells were lysed by two passages through the French press 
(15,000 psi), followed by centrifugation at 13,000 rpm for 30 min. The supernatant 
(2.4 gm) was dialyzed against buffer A containing 50 mM NaCl, loaded onto a 120 ml 
FFQ column (without ammonium sulfate precipitation) and eluted with a linear 
gradient of 100-700 mM NaCl. The peak fractions (fr# 41-55) were combined, 
diluted with buffer A containing no salt (a dilution of 1/5) to a conductivity of 100 
mM NaCl, loaded (310 mg) onto a 300 ml Heparin-Affigel column, and eluted with a 
linear gradient of 100-500 mM NaCl. The peak fractions (fr# 21-36) were combined, 
dialyzed against buffer A, loaded (87 mg) onto 10 ml FFQ column, and eluted as 
described for the first FFQ column. The peak fractions (fr# 27-41) were concentrated 
by centrifugation in Centriprep 30 filtration unit and frozen at -80°C. The final tau 
subunit preparation is shown in the lane marked t of the Coomassie Blue stained SDS- 
polyacrylamide gel of Figure 14. Yield = 103 mg. 

Example 26 - Identification of the S. pyogenes dnaN Gene and Purification of the 
Beta Subunit 

A search of the S. pyogenes genome database with the putative B. 
subtilis beta subunit amino acid sequence revealed a DNA sequence (contig # 266) 
with a high scoring match which predicted a gene encoding for a beta subunit protein. 
A set of PCR primers to 5'- and 3'- termini of the putative gene sequence were 
designed to include restriction enzyme recognition sequences for Ndel and BamHI, 
respectively. The primers were: 

dnaN Upstream (SEQ. ID. No. 75) 

ggagttcata tgattcaatt ttcaaattaa tcgc 34 

dnaN Downstream (SEQ. ID. No. 76) 

tatcagctcc tggatccagt accttccatt gattagcc 38 

A PCR product obtained with these primers was digested with Ndel and BamHI, 
purified and ligated into pET16b vector to produce pET16b.S.p.dnaN. 
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The pET16b.S.p.dnaN plasmid was transformed into the 
BL21(DE3)RecA- strain of E. coli. A single colony from an overnight transformation 
was used to innoculate 15L LB broth supplemented with 200 ug/ml Ampicillin. Cells 
were grown at 37°C to O.D.600=0.4, at which the point temperature was lowered to 
5 15°C and 0.5 mM IPTG was added. Induction proceeded for 1 6 hrs. In the morning, 

cells were collected by centrifugation and resuspended in 100 ml 50 mM Tris-HCl 
(pH 7.5)/ 10% Sucrose /IX Heat Lysis Buffer/1 M NaCl/5 mM DTT7 30 mM 
Spermidine/5 mM EDTA. Cells were lysed by two passages through the French press 
(15,000 psi), followed by centrifugation at 13,000 rpm for 30 min. Ammonium 

10 sulfate was added to the supernatant to a final concentration of 0.3 g/ml. The resulting 

protein pellet was resuspended and dialyzed against buffer A containing 50 mM NaCl. 
The dialyzed protein (300 mg) was loaded onto a 45 ml FFQ- Sepharose column and 
eluted with a linear gradient from 50 - 500 mM NaCl. The peak fractions (16-30) were 
combined, dialyzed against buffer A containing 50 mM NaCl, loaded onto a 25 ml 

15 EAH-Sepharose column, and eluted with a linear gradient of 50-500 mM NaCl. The 

fractions containing the least contaminants were combined into two pools (pool I 10- 
17, pool II 19-27). Each pool was further purified on a 8 ml MonoQ column 
(performed under conditions described for the FFQ column above). The final beta 
subunit preparation is shown in the lane marked p of the Coomassie Blue stained 

20 SDS-polyacrylamide gel of Figure 14. Yield = 48 mg. 

Example 27 - Identification of the S. pyogenes polC Gene and Purification of the 
Alpha-Large Polymerase Subunit 

25 A search of the B. subtilis genome database with the E. coli alpha 

subunit amino acid sequence revealed two DNA sequences with a high scoring match 
which predicted two genes encoding alpha-like polymerase subunits. The DNA 
sequence with the second highest scoring match which encoded the largest of the two 
polymerase subunits also appeared to encode for the epsilon exonuclease domain at 

30 the N- terminus of the putative alpha subunit. A search of the B. subtilis genome 

database with S. pyogenes DNA sequence confirmed this nucleotide sequence to 
encode the Gram positive homolog of the E. coli replicative polymerase subunit 
(alpha). This Gram negative alpha-like subunit lacked homology to epsilon. The 
gene encoding the large alpha polypeptide sequence (alpha-large) will be referred to as 
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the product of the polC gene and the gene encoding the smaller Gram-negative alpha- 
like polymerase (alpha-small) will be referred to as the product of the polE or dnaE 
gene (see Example 28). 

The alpha- large polymerase polypeptide is a product of two 
5 overlapping contigs; contig #197 (renamed #193) encodes the N-terminal 630 amino 

acids, and contig #278 (renamed #273) encodes the C-terminal 1392 amino acids. The 
putative Open Reading Frame generates a 1464 amino acid polypeptide (SEQ. ID. 
No. 18). Since the polC nucleotide sequence contained several Ndel sites, a primer 
was designed to mutate two restriction endonuclease sites in the pETl la nucleotide 

10 sequence upstream of the N-terminus of the gene; an Xbal restriction site was mutated 

to an Nhel restriction site and an Ndel restriction site at the starting ATG was 
removed. A 74mer primer which spans from mutated Xbal site upstream of T7 
promoter includes Nhel site, rbs site (ribosome binding site), mutated Ndel site and 
first 10 amino acid codons of poiC gene sequence. The following primers were used 

15 in a PCR reaction to amplify polC gene from S. pyogenes genomic DNA: 

polC Upstream (SEQ. ID. No. 77) 

ggataacaat tccccgctag caataatttt gtttaacttt aagaaggaga tatacccatg 60 
tcagatttat tcgc 74 

20 

polC Downstream (SEQ. ID. No. 78) 

cggtgtctct atctaaatga ctcatttggg atcctcgctt tatacggtat gtcacag 57 

Elongase (BRL) produced the best amplification results. PCR reaction conditions 
25 were: 5 jig genomic DNA, 20 ng of each primer, 1 ml Elongase, 60 uM each dNTP, 

in 100 ml Elongase reaction buffer for 1 min at 94°C, 1 min at 55°C, and 6 min at 
60°C repeated for 40 cycles. The resulting 4000 bp PCR fragment was digested with 
Nhel and BamHI, purified and ligated into the pETl 1 a vector (digested with Xbal and 
BamHI) to produce pETl la.S.p.polC. 
30 The pETl 1 a.S.p.polC plasmid was transformed into the 

BL2 1 (DE3)RecA- strain of E. coli. A single colony from an overnight transformation 
was used to innoculate 24L LB broth supplemented with 200 ug/ml Ampicillin. Cells 
were grown at 37°C to OD600=0.4 at which point temperature was lowered to 15°C 
and 0.5 mM IPTG was added. Induction proceeded for 16 hrs. In the morning, cells 
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(12g) were collected by centrifugation and resuspended in 100 ml 50 mM Tns-HCl 
(pH 7.5)/ 10% Sucrose /IX Heat Lysis Buffer/1 M NaCl/5mM DTT/30 mM 
Spermidine/5 mM EDTA. Cells were lysed by two passages through the French press 
(15,000 psi), followed by centrifugation at 13,000 rpm for 30 min. Ammonium 
5 sulfate was added to the supernatant to a final concentration of 0.226 g/ml. The 

precipitate was collected by centrifugation. The protein pellet (220 mg resuspended in 
buffer A) was dialyzed against buffer A containing 150 mM NaCl, loaded onto an 8 
ml FFQ column equilibrated with buffer A containing 150 mM NaCl, and eluted with 
a linear gradient of buffer A containing 150-600mM NaCl. The fractions containing 

10 the least contaminants (fr# 42-64) were combined and precipitated with ammonium 

sulfate (0.226 g/ml). The precipitate was collected by centrifugation and resuspended 
in buffer A (10 mg/ml in 5 ml). A fraction (1 ml=10mgs) of the concentrated protein 
was dialyzed, loaded onto 10 ml ssDNA-agarose column, and eluted with a linear 
gradient of 50-500 mM NaCl. The peak fractions (fr# 30-50) were combined and 

15 concentrated with ammonium sulfate (as above). The final alpha-large subunit 

preparation is shown in lane marked a L of the Coomassie Blue stained SDS- 
polyacrylamide gel of Figure 14. Yield= 4 mgs. 

Example 28 - Identification of the S. pyogenes dnaE Gene and Purification of the 
20 Alpha-Small Polymerase 

A search of the B. subtilis genome database using the E. coli alpha 
subunit amino acid sequence revealed two DNA sequences with a high scoring match 
which predicted two genes encoding for alpha-like polymerase subunits. The DNA 

25 sequence with the highest scoring match encodes a smaller alpha polymerase which 

does not contain an exonuclease domain. The putative short alpha DNA sequence is 
a product of the open reading frame in contig #253 of the S. pyogenes genome 
database. A set of PCR primers to 5'- and 3 '-termini of the putative gene sequence 
were designed to include restriction enzyme recognition sequences for Ndel and 

30 BamHI, respectively. The primers were: 



g -short Upstream (SEQ. ED. No. 79) 

gggaacaaga taaccaagga ggaacccatg gttgctcaac ttg 



43 
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g -short Downstream (SEQ. ID. No. 80) 

cgaatagcag cgttcatacc aggatcctcg ccgccactgg 40 

A PCR product obtained with these primers was digested with Ndel and BamHI, 
5 purified and ligated into pETl la vector to produce pETl la.S.p.dnaE. 

The pETl la.S.p.dnaE plasmid was transformed into the 
BL21(DE3)RecA- strain of E. coli. A single colony from an overnight transformation 
was used to innoculate 12L LB broth supplimented with 200 ng/m\ Ampicillin. Cells 
were grown at 37°C to OD600=0.4, at which point temperature was lowered to 15°C 

10 and 0.5 mM IPTG was added. Induction proceeded for 16 hrs. In the morning, cells 

were collected by centrifugation and resuspended in 100 mis 50 mM Tris-HCl (pH 
7.5)/ 10% Sucrose /IX Heat Lysis Buffer/5 mM DTT/30 mM Spermidine/IM NaCl/5 
mM EDTA. Cells were lysed by two passages through the French press (1 5,000 psi), 
followed by centrifugation at 13,000 rpm for 30 min. Ammonium sulfate was added 

15 to the supernatant to a final concentration of 0.226 g/ml. The precipitate was 

collected by centrifugation. The protein pellet (resuspended in buffer A) was then 
dialyzed against buffer A. The dialyzed protein (600 mg) was loaded onto a 30 ml 
FFQ and eluted with a linear gradient of buffer A containing 50-500 mM NaCl. The 
peak fractions (200 mg in fr # 70-79) were dialyzed and loaded onto a 100 ml 

20 Heparin-Affigel column. The fractions containing the least contaminants (100 mg 

from fr # 18-30) were pooled and dialyzed against buffer A containing 300 mM NaCl. 
The dialysate (50 mg) was loaded onto a 50 ml ssDNA-agarose column and eluted 
with a linear gradient of 300mM - 1M NaCl. The final alpha-small subunit 
preparation is shown in lane marked a s of the Coomassie Blue stained SDS- 

25 polyacrylamide gel of Figure 14. Yield = 25 mg. 

Example 29 - Identification of the S. pyogenes ssb Gene and Purification of the 
Single Strand DNA-Binding Protein 

30 Search of the S. pyogenes genome using the B. subtilis SSB amino acid 

sequence identified a polypeptide in contig #230(212) as having highest homology to 
single strand binding protein of several Gram negative bacteria. This contig lacked the 
first 26 amino acids at the N-terminus. Circular PCR was employed to identify the 
DNA encoding the N-terminus of the putative SSB protein. 5". pyogenes genomic 
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DNA was digested overnight with Apol (5 fig chromosomal DNA in a 50 ul reaction). 
The DNA was extracted with phenol and precipitated with ethanol. The Apol 
digested chromosomal DNA was self-ligated to generate circular template for future 
use in the circular PCR. A circular PCR was performed with primers designed to 
5 anneal back-to-back to amplify circularized Apol reaction fragments. The primers 

were: 

ssb.circ Upstream (SEQ. ID. No. 81) 

accattttgg cttttaaagg tacggttaac agcaagtgtg aaggtagcc 49 

10 

ssb.circ Downstream (SEQ. ID. No. 82) 

gaacgcgagg cagatttcat taactgtgtg atctggcg 38 

The PCR reaction conditions were as follows: 100 ng circularized S. pyogenes 
15 genomic DNA, 20 ng each primer, 1 ml Elongase, 60 uM each dNTP, 100 1 

Elongase reaction buffer. Amplification was performed for 40 cycles as follows: 
denature, 1 min at 94°C; anneal, 1 min at 55°C; and extend, 5 min at 68°C. PCR 
products were cloned into the Topo TA vector following instructions of the 
manufacturer (Promega). Several positive clones were sequenced to obtain N- 
20 terminal nucleotide sequence. This information lead to design of the following 

primers with which the use of a standard PCR reaction generated whole ssb gene 
products. The primers were: 

ssb Upstream (SEQ. ID. No. 83) 

25 tttaaaagag ggtagcatat gattaataat gtagtactag ttggtcgc 48 

ssb Downstream (SEQ. ID. No. 84) 

tttaaattta aacctaggtt caatccattc tgactagaat ggaagatcgt c 51 

30 The resulting PCR product was digested with Ndel and BamHI, purified and ligated 

into pETl la vector to produce pETl la.S.p. ssb. 

The pETl la.S.p. ssb plasmid was transformed into the 
BL21(DE3)RecA- strain of E. coli. A single colony from an overnight transformation 
was used to innoculate 12L LB broth supplemented with 200 ug/ml Ampicillin. Cells 
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were grown at 37°C to OD600=0.5, at which point 0.5 mM IPTG was added. At the 
end of the 3 hr induction, cells were collected by centrifugation and resuspended in 
100 ml of 50 mM Tris-HC! (pH 7.5)/ 10% Sucrose /IX Heat Lysis Buffer/5 mM 
DTT/5 mM EDTA. The cell lysis was initiated upon addition of 0.4 mg/ml lysozyme 
followed by a 1 hr incubation on ice. The lysate was clarified by centrifugation at 
1 3,000 rpm for 30 min. The SSB protein was significantly purified by sequential 
fractionation with ammonium sulfate in the following manner. Solid ammonium 
sulfate was added to the clarified lysate to a final concentration of 0.24 g/ml and the 
precipitated protein was collected by centrifugation at 13,000 rpm for 30 min. The 
resulting pellet was homogenized in buffer A(0. 1 M NaCl) + 0.24 g/ml ammonium 
sulfate and the precipitate was collected by centrifugation. This procedure was 
repeated with buffer A(0.1 M NaCl) + 0.2 g/ml ammonium sulfate, buffer A(0.1 M 
NaCl + 0.15 g/ml ammonium sulfate, and buffer A(0.1 M NaCl) +0.13 g/ml 
ammonium sulfate. The final pellet was resuspended in buffer A + 0.15 M NaCl and 
dialyzed against the same buffer. The resulting pellet was resuspended in buffer A 
and dialyzed against buffer A containing 500 mM NaCl. The dialysate (300 mg) was 
diluted to 0. 15 M NaCl before it was loaded onto a 20 ml MonoQ column and eluted 
with a linear gradient of 0.15 M - 0.5 M NaCl in buffer A. The SSB protein elutes in 
the very beginning of the gradient. The peak fractions were combined (150 mg in 
fractions 16-30), diluted to 0.05 M NaCl, loaded onto a 10 ml ssDNA-agarose 
column, and eluted with 0.5 M NaCl. The peak fractions (32-62) were combined and 
frozen. The SSB was further purified over a MonoQ column to remove contaminating 
polymerase activity. The final single strand DNA binding protein preparation is shown 
in lane marked ssb of the Coomassie Blue stained SDS-polyacryl amide gel of Figure 
14. Yield = 120 mg. 

Example 30 - First Demonstration that S. pyogene holA Encodes a Delta Subunit 
Involved In Replication: Assembly of t56' Complex 

Gel filtration is a standard analytical technique to demonstrate direct 
protein-protein interaction. Purified t, 5, 5' proteins were used to examine whether 
they form a protein complex assembly. Gel filtration of t mixed with either 6, 8', or 
both 6 and 8' was performed using an HR 10/30 Superose 6 column equilibrated with 
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buffer A containing 100 mM NaCl. Either 5 (200 ug), 5' (200 ug), or a mixture 
of 5 and 8' (200 ug each) was incubated for 30 min at 15°C in 100 ul of buffer A 
containing 100 mM NaCl, and the entire mixture was injected onto the column. The 
mixture was resolved on the column by collection of 1 70 ul fractions after the initial 
5 void (6.6 ul) volume was collected. Fractions were analyzed by 10% SDS- 

polyacrylamide gels (30 ul/lane) stained with Coomassie Blue. 

The results, in Figure 15, demonstrate that under these conditions 
the x protein exhibits no (weak) interaction with the delta (Figure 15B) and the delta 
prime subunits (Figure 15C) individually, and yet assembles readily into a complex 

10 when all the subunits are mixed in the reaction (Figure 15 A). The t protein was 

mixed with a 2-fold molar excess of each S and 6', then gel filtered. A complex 
of t65' was formed as demonstrated by coellution of 8 and 8' with x (fr# 22-30) whereas 
excess 68' complex elutes in later fractions (fr#38-46). To determine whether 
individual 8 or 8' subunits interact with t, the x subunit was mixed with either 8 or 8' 

15 and then gel filtered. The results demonstrate that a gel filterable complex does not 

form when x is mixed with 5 (Figure 15B) or 8' (Figure 15C) subunits individually, as 
indicated by the absence of these subunits in the x containing fractions (fr#20-26). 
Therefore, it appears that the presence of both 8 and 8' subunits is essential for the 
formation of the x88' complex. 

20 

Example 31 - Second Demonstration that S. pyogenes holA Encodes Delta: 
Functional Assembly of P on DNA 

Gel filtration was used to demonstrate that the x, 8, 8' proteins form a 
25 functional clamp loading complex which is able to load the p clamp onto a circular 

DNA molecule. The reaction contained 0.5 pmol of gp2 nicked pBluescript plasmid 
(a circular double strand plasmid with a single nick produced by Ml 3 gp2 protein), 1 
pmol [ 32 P]P, 0.5 pmol x88' complex, 0.25 pmol of either 8, 8', x were used in 
individual experiments when a subassembly of the complex was tested (xS, x8\ 88') in 
30 75 ul buffer B (20 mM Tris-HCl (pH 7.5), 20 % glycerol, 0. 1 mM EDTA, 5 mM 

DTT, 2 mM ATP, 8 mM MgCl2). P was incubated with nicked DNA for 10 min at 
37°C either alone, or in combination with various assemblies of the x complex. All gel 
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filtration experiments were performed at 4°C. The reaction mixtures were applied to a 
5 ml column of Bio-Gel 15M (Bio-Rad) equilibrated in buffer B containing 100 mM 
NaCl. Fractions of 170 ul were collected and quantitated in the Scintillation counter. 

The results, in Figure 16, demonstrate that the assembly of the ring 
5 onto a circular DNA molecule requires the presence of x, 8, and 5' proteins 

(Figure 16A). In absence of any one of the subunits, loading onto DNA does not 
occur (Figure 16B-E). The clamp loader complex (t55') can be supplied as a mixture 
of t, 5, 8' subunits or as an assembled complex (purified from unassembled subunits by 
gel filtration, or by ion exchange chromatography on MonoQ). Proteins bound to the 
10 large DNA molecule elute in the early fractions (void fr# 10-17) and resolve from free 

proteins that elute in later fractions (fr# 18-35). 

Example 32 - The x Subunit Product of the dnaX Gene Binds a -large 

15 The interaction of S. pyogenes a and t proteins was examined by 

analyzing a mixture of the proteins by gel filtration. Gel filtration of t, a -large or a 
mixture of a-large and x was performed using an HR 10/30 Superose 6 column 
equilibrated with buffer A containing 100 mM NaCl. Either a-large (400 ug) (200 
uM) or a mixture of a-large and t was incubated for 30 min at 15°C in 100 pi of 

20 buffer A containing 1 00 mM NaCl, and the entire mixture was injected onto the 

column. The mixture was resolved on the column by collection of 170 ul fractions 
after the initial void (6.6 ml) volume was collected. Fractions were analyzed by 10% 
SDS-polyacrylamide gels (30 ul/lane) stained with Coomassie Blue. 

The results show a complex of a L x was formed as demonstrated by 

25 coellution of a-large and t (fr# 30-38) proteins (Figure 1 7A) compared to the elution 

profile of individual proteins (Figure 17B-C). Also, the migration of the x in the cilt 
complex changes significantly to a larger complex (4 fractions, from fr# 37 to fr# 33). 

Example 33 - Formation of aLx88' Complex 

30 

To determine whether a olxSS' complex could form, the following 
components were mixed: a -large (400 ug, 2.5 nmol), x (200 ug, 1.3 nrnol), 6 (200 ug, 
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4.8 nmol), 8' (200 ug, 5.75 pmol) in a final volume of 150 ul. The mixture was diluted 
to 300 ml with buffer A to lower conductivity of the sample to that equivalent of 100 
mM NaCl and incubated for 30 min at 15°C. The mixture was injected onto a 
Superose 6 column (equilibrated with buffer A containing 100 mM NaCl) and 
fractions (170 ul) were collected after an initial 6.6 ml of void volume was collected. 
Fractions were analyzed by 10% SDS-polyacrylamide gels (30 ul/lane) stained with 
Coomassie Blue. 

A gel filterable complex (Figure 18A) of cxltSS' was formed as 
demonstrated by coellution of t, 5 and 6' with a -large (fr# 14-26), whereas 
excess 68' complex elutes in later fractions (fr# 30-38). The migration of 
the tSS' protein complex in the cxlt88' complex does not change significantly. The 
complex might dissociate under the nonequilibrium conditions of gel filtration due to 
low concentration of proteins, salt concentration and speed of resolution. 

Next, ion exchange chromatography was used to analyze the protein 
mixture to prepare the reconstituted cxltSS' complex of S. pyogenes. The airSS' 
complex was reconstituted upon mixing a -large (10 mg, 62 nmol), t (6 mg, 72 nmol), 
6 (3.3 mg, 80 nmol), 8' (1.6 mg, 90 nmol). The a, i, S, 8' protein mixture was dialyzed 
for 2 hrs against buffer A containing 50 mM NaCl. The entire mixture was loaded 
onto a 1 ml MonoQ column equilibrated in buffer A containing 50 mM NaCl. Proteins 
were eluted with a 20 column volume linear gradient of 50-500 mM NaCl in buffer A 
and 0. 25 ml fractions were collected. Fractions were analyzed by 10% SDS- 
polyacrylamide gels (20 ul/lane) stained with Coomassie Blue. 

Generally, the reconstitution of the oclt8S' complex on a MonoQ 
column results in a tight salt resistant complex (Figure 18B, fr# 23-35) which elutes at 
500 mM NaCl. The high concentration of the proteins in the eluted fractions 
contributes to stability of the complex. 

Example 34 - The S. pyogenes Three Component Pol III-L Polymerase Is Rapid 
and Processive In DNA Synthesis 



It was previously demonstrated (i.e., in Examples 29 and 30) that the 
putative delta subunit plays an integral part in the assembly of the t88' complex 
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(Figure 15) and that this complex is sufficient to assemble p clamps onto circular 
primed DNA (Figure 16). It was also shown that the strong interaction between the a - 
large and t subunits (Figure 17) results in an isolatable a L T58' complex (Figure 18), 
similar to that of the E. coli DNA polymerase ID*. 

The MonoQ fractions containing ccltSS' complex were then used to 
assemble p onto primed DNA and determine whether this now resulted in rapid and 
processive DNA synthesis. Replication reactions contained 70 ng of singly primed 
M13mpl8 ssDNA and 0.82 ug of £ pyogenes SSB in 25 ul buffer C (20 mM Tris- 
HC1 (pH 7.5), 4 % glycerol, 0.1 mM EDTA, 5 mM DTT, 2 mM ATP, 8 mM MgCl2) 
with 60 \iM each of dGTP, dCTP, and dATP, 30 uM cold TTP and 20 uM [o-32P] 
TTP (specific activity of 2,000-4,000 cpm/pmol). The complex is assembled onto 
DNA in the following manner: 40 ng (3:1) or 140 ng (10:1) of the aixbtf complex and 
60 ng of p protein were preincubated for 2 min at 30°C in presence of SSB coated 
primed Ml 3 DNA and two nucleotides (dCTP and dGTP). Reactions were initiated by 
addition of the two remaining nucleotides dATP and TTP and quenched with an equal 
volume of 1% SDS/40 mM EDTA. Each time point is a separate reaction. 

A time course of replication on singly primed circular M13mpl8 
ssDNA is shown in Figure 19. The agarose gel analysis shows conversion of the 
oligonucleotide primed single stranded DNA to the slower migrating replicative form 
II. The fact that the speed of synthesis is independent of the concentration of 
polymerase in the reaction indicates that the cxltSS' complex synthesizes DNA in a 
rapid and a highly processive manner. The S. pyogenes ajjt8&' complex in presence of 
the p clamp, completely replicates (is able to complete replication of) 7250 nt of 
M13mpl8 ssDNA in 8-9 sec. 

Example 35 - The S. pyogenes DnaE (a-small) Forms a Three-Component 
Polymerase with x65' and p 

The S. pyogenes DnaE (a-small) polymerase is more homologous to E. coli a 
than S. pyogenes PolC. Thus, it seems reasonable to expect that the DnaE polymerase 
may also function with the p clamp (Figs. 21 A-B). To test DnaE for function with 
x85' and p, replication reactions contained 70 ng (25 frnol) of 30-mer singly primed 
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M13mpl 8 ssDNA, 0.82 ug of S. pyogenes SSB, and 3.3 ng - 300 ng ofDnaE (25 finol 
- 2.3 pmol) in 23.5 ul of 20 mM Tris-HCl (pH 7.5), 4% glycerol, 0.1 mM EDTA, 5 
mM dithiothreitol (DTT), 40 ug/ml BSA, 2 mM ATP, 8 mM MgCl 2 , and 60 uM each 
of dGTP and dCTP. When present, reactions included 43 .3 ng of p and 1 0 ng of t58'. 
Reactions were preincubated for 3 min at 37°C, and then NaCl was added to 40 mM 
followed by another 2 min at 37°C. DNA synthesis was initiated upon addition of 1.5 
ul of 1.5 mM dATP, 0.5 mM [a 32 P]-dTTP (specific activity 2,000-4,000 cpm/pmol). 
Aliquots of 25 ul were removed at the indicated times and quenched with an equal 
volume (25 ul) of 1% SDS, 40 mM EDTA. One-half of the quenched reaction was 
analyzed for total deoxynucleotide incorporation using DE81 filter paper and the other 
half was analyzed on a 0.8% neutral agarose gel. The effect of TMAU was also 
examined, in which lOOuM TMAU in DMSO (2% DMSO final concentration) was 
present. In this case, replication was allowed to proceed for 1 min before being 
quenched with 25 ul of 1% SDS, 40 mM EDTA. 

At a saturating concentration of DnaE polymerase, the time course of primer 
extension shows that it completes an M13mpl8 primed ssDNA template within 2 
minutes for a speed of at least 60 nucleotides/s (Fig. 21C). This rate of synthesis 
holds true for the highest amount of DnaE in the rightmost panel of the figure. As the 
DnaE concentration is decreased, a longer time is required to complete the circular 
template, indicating that the DnaE polymerase is not processive over the entire length 
of the M13mpl8 template. If the DnaE polymerase were fully processive during 
synthesis of the 7.2 kb ssDNA circle, the product profile over time would be 
qualitatively similar at all concentrations of enzyme, but the overall intensity of the 
profile would be diminished. This particular experiment was performed in the 
absence of p, but presence of t58'. When repeated in the presence of p but without 
t88', and in the absence of both p and x55', results similar to those shown in Fig. 21C 
were observed. 

In the presence of p and x88', DnaE polymerase is stimulated in synthesis at 
low concentration, indicating that p increases the processivity and/or speed of DnaE 
(Figs. 21C-D). At higher concentrations of DnaE, the presence of p/t65' has no effect 
on the rate of synthesis, and thus p does not increase the intrinsic speed of the enzyme 
(i.e., panels 3 and 4 of Fig. 2 ID). Hence, the effect of the p clamp on DnaE is 
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primarily due to an increase in processivity. The profile of product length over time 
remains essentially unchanged at the different DnaE concentrations, and therefore the 
processivity of DnaE, with p is at least equal to the 7.2 kb length of the M13mpl8 
substrate. 

5 The DnaE sequence does not show homology to an exonuclease, implying that 

it may have no associated nuclease activity. The DnaE preparation was examined for 
the presence of a 3'-5' exonuclease (Fig. 2 IE). The DnaE and PolC polymerases were 
each incubated with a 5' 32P-labeled oligonucleotide, followed by analysis in a 
sequencing gel. The result showed no degradation of the oligonucleotide by DnaE. 
10 PolC is a known 3'-5' exonuclease and it digests the end-labeled oligonucleotide as 

expected. 

Gram positive PolC is known to be inhibited by the antibiotic 
hydroxyphenylaza-uracil ("HPUra") and its derivatives, in Fig. 2 IF, the PolC t55', P 
and DnaE were tested for inhibition of synthesis on SSB coated primed M13mpl8 
15 ssDNA by an HPUra derivative, trimethylanilino-uracil ("TMAU"). The PoIC-tSS' p 

enzyme was prevented from forming the RFII product by TMAU. In contrast, the 
DnaE polymerase was not affected by TMAU in the presence of t55'/P (nor in the 
absence of toS'/P, not shown). 

20 Although the invention has been described in detail for the purpose of 

illustration, it is understood that such detail is solely for that purpose, and variations 
can be made therein by those skilled in the art without departing from the spirit and 
scope of the invention which is defined by the following claims. 
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WHAT IS CLAIMED : 

1 . An isolated DNA molecule from a Gram positive bacterium, 
the isolated DNA molecule comprising a coding region from a polC gene, a dnaE 
gene, a holA gene, a /ao/fi gene, a dnaX gene, a dnaN gene, a 556 gene, a dnaG gene, or 
a cfriai? gene. 

2. The isolated DNA molecule according to claim 1, wherein the 
DNA molecule comprises the coding region from the polC gene. 

3. The isolated DNA molecule according to claim 2, wherein the 
Gram positive bacterium is Streptococcus pyogenes. 

4. An isolated DNA molecule according to claim 3, wherein the 
DNA molecule encodes an amino acid sequence comprising SEQ. ID. No. 18. 

5. The isolated DNA molecule according to claim 4, wherein the 
DNA molecule comprises a nucleotide sequence of SEQ. ID. No. 17. 

6. The isolated DNA molecule according to claim 2, wherein the 
DNA molecule hybridizes to a nucleic acid molecule of SEQ. ID. No. 17 under 
stringent conditions characterized by use of a hybridization buffer comprising 0.9M 
SSC buffer at a temperature of 37°C. 

7. The isolated DNA molecule according to claim 1, wherein the 
DNA molecule comprises the coding region from the dnaE gene. 

8. The isolated DNA molecule according to claim 7, wherein the 
Gram positive bacterium is Streptococcus pyogenes. 

9. The isolated DNA molecule according to claim 8, wherein the 
DNA molecule encodes an amino acid sequence comprising SEQ. ID. No. 20. 
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10. The isolated DNA molecule according to claim 9, wherein the 
DNA molecule comprises a nucleotide sequence of SEQ. ID. No. 19. 

1 1 . The isolated DNA molecule according to claim 7, wherein the 
DNA molecule hybridizes to a nucleic acid molecule of SEQ. ID. No. 19 under 
stringent conditions characterized by use of a hybridization buffer comprising 0.9M 
SSC buffer at a temperature of 37°C. 

1 2. The isolated DNA molecule according to claim 1 , wherein the 
DNA molecule comprises the coding region from the holA gene. 

13. The isolated DNA molecule according to claim 12, wherein the 
Gram positive bacterium is Streptococcus pyogenes. 

14. The isolated DNA molecule according to claim 13, wherein the 
DNA molecule encodes an amino acid sequence comprising SEQ. ID. No. 22. 

15. The isolated DNA molecule according to claim 14, wherein the 
DNA molecule comprises a nucleotide sequence of SEQ. ID. No. 21. 

1 6. The isolated DNA molecule according to claim 1 2, wherein the 
DNA molecule hybridizes to a nucleic acid molecule of SEQ. ID. No. 21 under 
stringent conditions characterized by use of a hybridization buffer comprising 0.9M 
SSC buffer at a temperature of 37°C. 

1 7. The isolated DNA molecule according to claim 1 2, wherein the 
Gram positive bacterium is Staphylococcus aureus. 

1 8. The isolated DNA molecule according to claim 1 7, wherein the 
DNA molecule encodes an amino acid sequence comprising SEQ. ED. No. 12. 

1 9. The isolated DNA molecule according to claim 1 8, wherein the 
DNA molecule comprises a nucleotide sequence of SEQ. ID. No. 1 1. 
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20. The isolated DNA molecule according to claim 12, wherein the 
DNA molecule hybridizes to a nucleic acid molecule of SEQ. ED. No. 1 1 under 
stringent conditions characterized by use of a hybridization buffer comprising 0.9M 
SSC buffer at a temperature of 37°C. 

2 1 . The isolated DNA molecule according to claim 1 , wherein the 
DNA molecule comprises the coding regiong from the holB gene. 

22. The isolated DNA molecule according to claim 21, wherein the 
Gram positive bacterium is Streptococcus pyogenes. 

23. The isolated DNA molecule according to claim 22, wherein the 
DNA molecule encodes an amino acid sequence comprising SEQ. ID. No. 24. 

24. The isolated DNA molecule according to claim 23, wherein the 
DNA molecule comprises a nucleotide sequence of SEQ. ED. No. 23. 

25. The isolated DNA molecule according to claim 21, wherein the 
DNA molecule hybridizes to a nucleic acid molecule of SEQ. ED. No. 23 under 
stringent conditions characterized by use of a hybridization buffer comprising 0.9M 
SSC buffer at a temperature of 37°C. 

26. The isolated DNA molecule according to claim 2 1 , wherein the 
Gram positive bacterium is Staphylococcus aureus. 

27. The isolated DNA molecule according to claim 26, wherein the 
DNA molecule encodes an amino acid sequence comprising SEQ. ED. No. 14. 

28. The isolated DNA molecule according to claim 27, wherein the 
DNA molecule comprises a nucleotide sequence of SEQ. ED. No. 13. 
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29. The isolated DNA molecule according to claim 2 1 , wherein the 
DNA molecule hybridizes to a nucleic acid molecule of SEQ. ED. No. 13 under 
stringent conditions characterized by use of a hybridization buffer comprising 0.9M 
SSC buffer at a temperature of 37°C. 

30. The isolated DNA molecule according to claim 1, wherein the 
DNA molecule comprises the coding region from the dnaX gene. 

3 1 . The isolated DNA molecule according to claim 30, wherein the 
Gram positive bacterium is Streptococcus pyogenes. 

32. The isolated DNA molecule according to claim 31, wherein the 
DNA molecule encodes an amino acid sequence comprising SEQ. ID. No. 26. 

33. The isolated DNA molecule according to claim 32, wherein the 
DNA molecule comprises a nucleotide sequence of SEQ. ID. No. 25. 

34. The isolated DNA molecule according to claim 30, wherein the 
DNA molecule hybridizes to a nucleic acid molecule of SEQ. ED. No. 25 under 
stringent conditions characterized by use of a hybridization buffer comprising 0.9M 
SSC buffer at a temperature of 37°C. 

35. The isolated DNA molecule according to claim 1, wherein the 
DNA molecule comprises the coding region from the dnaN gene. 

36. The isolated DNA molecule according to claim 35, wherein the 
Gram positive bacterium is Streptococcus pyogenes. 

37. The isolated DNA molecule according to claim 36, wherein the 
DNA molecule encodes an amino acid sequence comprising SEQ. ED. No. 28. 

38. The isolated DNA molecule according to claim 37, wherein the 
DNA molecule comprises a nucleotide sequence of SEQ. ED. No. 27. 
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39. The isolated DNA molecule according to claim 35, wherein the 
DNA molecule hybridizes to a nucleic acid molecule of SEQ. ID. No. 27 under 
stringent conditions characterized by use of a hybridization buffer comprising 0.9M 
SSC buffer at a temperature of 37°C. 

40. The isolated DNA molecule according to claim 1, wherein the 
DNA molecule comprises the coding region from the ssb gene. 

41. The isolated DNA molecule according to claim 40, wherein the 
Gram positive bacterium is Streptococcus pyogenes. 

42. The isolated DNA molecule according to claim 41, wherein the 
DNA molecule encodes an amino acid sequence comprising SEQ. ID. No. 30. 

43. The isolated DNA molecule according to claim 42, wherein the 
DNA molecule comprises a nucleotide sequence of SEQ. ID. No. 29. 

44. The isolated DNA molecule according to claim 40, wherein the 
DNA molecule hybridizes to a nucleic acid molecule of SEQ. ID. No. 29 under 
stringent conditions characterized by use of a hybridization buffer comprising 0.9M 
SSC buffer at a temperature of 37°C. 

45. The isolated DNA molecule according to claim 1, wherein the 
DNA molecule comprises the coding region from the dnaG gene. 

46. The isolated DNA molecule according to claim 45, wherein the 
Gram positive bacterium is Streptococcus pyogenes. 

47. The isolated DNA molecule according to claim 46, wherein the 
DNA molecule encodes an amino acid sequence comprising SEQ. ID. No. 32. 
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48. The isolated DNA molecule according to claim 47, wherein the 
DNA molecule comprises a nucleotide sequence of SEQ. ID. No. 31. 

49. The isolated DNA molecule according to claim 45, wherein the 
DNA molecule hybridizes to a nucleic acid molecule of SEQ. ID. No. 31 under 
stringent conditions characterized by use of a hybridization buffer comprising 0.9M 
SSC buffer at a temperature of 37°C. 

50. The isolated DNA molecule according to claim 1 , wherein the 
DNA molecule comprises the coding region from the dnaB gene. 

5 1 . The isolated DNA molecule according to claim 50, wherein the 
Gram positive bacterium is Streptococcus pyogenes. 

52. The isolated DNA molecule according to claim 51, wherein the 
DNA molecule encodes an amino acid sequence comprising SEQ. ID. No. 34. 

53. The isolated DNA molecule according to claim 52, wherein the 
DNA molecule comprises a nucleotide sequence of SEQ. ID. No. 33. 

54. The isolated DNA molecule according to claim 50, wherein the 
DNA molecule hybridizes to a nucleic acid molecule of SEQ. ID. No. 33 under 
stringent conditions characterized by use of a hybridization buffer comprising 0.9M 
SSC buffer at a temperature of 37°C. 

55. An expression system comprising an expression vector into 
which is inserted a heterologous DNA molecule according to claim 1 . 

56. The expression system according to claim 55, wherein the 
heterologous DNA molecule is in sense orientation and correct reading frame. 

57 A host cell comprising a heterologous DNA molecule 
according to claim 1 . 
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58. An isolated protein or polypeptide from a Gram positive 
bacterium, wherein the isolated protein or polypeptide is alpha-large, alpha-small, 
delta, delta prime, tau, beta, SSB, DnaG, or DnaB. 

59. The isolated protein or polypeptide according to claim 58, 
wherein the isolated protein or polypeptide is alpha-large. 

60. The isolated protein or polypeptide according to claim 59, 
wherein the Gram positive bacterium is Streptococcus pyogenes. 

61. The isolated protein or polypeptide according to claim 60, 
wherein the alpha-large protein or polypeptide comprises an amino acid sequence of 
SEQ. ID. No. 18. 

62. The isolated protein or polypeptide according to claim 58, 
wherein the isolated protein or polypeptide is alpha-small. 

63. The isolated protein or polypeptide according to claim 62, 
wherein the Gram positive bacterium is Streptococcus pyogenes. 

64. The isolated protein or polypeptide according to claim 63, 
wherein the alpha-small protein or polypeptide comprises an amino acid sequence of 
SEQ. ID. No. 20. 

65. The isolated protein or polypeptide according to claim 58, 
wherein the isolated protein or polypeptide is delta. 



66. The isolated protein or polypeptide according to claim 65, 
30 wherein the Gram positive bacterium is Streptococcus pyogenes. 
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67. The isolated protein or polypeptide according to claim 66, 
wherein the delta protein or polypeptide comprises an amino acid sequence of SEQ. 
ID. No. 22. 



wherein the Gram positive bacterium is Staphylococcus aureus. 

69. The isolated protein or polypeptide according to claim 68, 
wherein the delta protein or polypeptide comprises an amino acid sequence of SEQ. 

10 ED. No. 12. 

70. The isolated protein or polypeptide according to claim 58, 
wherein the isolated protein or polypeptide is delta prime. 

15 71. The isolated protein or polypeptide according to claim 70, 

wherein the Gram positive bacterium is Streptococcus pyogenes. 

72. The isolated protein or polypeptide according to claim 71, 
wherein the delta prime protein or polypeptide comprises an amino acid sequence of 

20 SEQ. ED. No. 24. 

73 . The isolated protein or polypeptide according to claim 70, 
wherein the Gram positive bacterium is Staphylococcus aureus. 

25 74. The isolated protein or polypeptide according to claim 73, 



wherein the delta prime protein or polypeptide comprises an amino acid sequence of 
SEQ. ED. No. 14. 



5 



68. The isolated protein or polypeptide according to claim 65, 



30 



75. The isolated protein or polypeptide according to claim 58, 
wherein the isolated protein or polypeptide is tau. 



76. The isolated protein or polypeptide according to claim 75, 
wherein the Gram positive bacterium is Streptococcus pyogenes. 
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77. The isolated protein or polypeptide according to claim 76, 
wherein the tau protein or polypeptide comprises an amino acid sequence of SEQ. ED. 
No. 26. 

5 

78. The isolated protein or polypeptide according to claim 58, 
wherein the isolated protein or polypeptide is beta. 

79. The isolated protein or polypeptide according to claim 78, 
10 wherein the Gram positive bacterium is Streptococcus pyogenes. 

80. The isolated protein or polypeptide according to claim 79, 
wherein the beta protein or polypeptide comprises an amino acid sequence of SEQ. 
ID. No. 28. 

15 

8 1 . The isolated protein or polypeptide according to claim 58, 
wherein the isolated protein or polypeptide is SSB. 

82. The isolated protein or polypeptide according to claim 81, 
20 wherein the Gram positive bacterium is Streptococcus pyogenes. 

83. The isolated protein or polypeptide according to claim 82, 
wherein SSB comprises an amino acid sequence of SEQ. ID. No. 30. 

25 84. The isolated protein or polypeptide according to claim 58, 

wherein the isolated protein or polypeptide is DnaG. 

85. The isolated protein or polypeptide according to claim 84, 
wherein the Gram positive bacterium is Streptococcus pyogenes. 

30 

86. The isolated protein or polypeptide according to claim 85, 
wherein the DnaG protein or polypeptide comprises an amino acid sequence of SEQ. 
ID. No. 32. 
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87. The isolated protein or polypeptide according to claim 58, 
wherein the isolated protein or polypeptide is DnaB. 

5 88. The isolated protein or polypeptide according to claim 87, 

wherein the Gram positive bacterium is Streptococcus pyogenes. 

89. The isolated protein or polypeptide according to claim 88, 
wherein the DnaB protein or polypeptide comprises an amino acid sequence of SEQ. 
10 ID. No. 34. 



90. A method of identifying compounds which inhibit the activity 
of a polymerase product of polC or dnaE comprising: 

forming a reaction mixture comprising a primed DNA molecule, a 
15 polymerase product of polC or dnaE, a candidate compound, a dNTP, and optionally 

either a beta subunit, a tau complex, or both the beta subunit and the tau complex, 
wherein at least one of the polymerase product of polC or dnaE, the beta subunit, the 
tau complex, or a subunit or combination of subunits thereof is derived from a 
Eubacteria other than Escherichia coli; 
20 subjecting the reaction mixture to conditions effective to achieve 

nucleic acid polymerization in the absence of the candidate compound; 

analyzing the reaction mixture for the presence or absence of nucleic 
acid polymerization extension products; and 

identifying the candidate compound in the reaction mixture where there 
25 is an absence of nucleic acid polymerization extension products. 



91 . The method according to claim 90, wherein the polymerase 
product of polC or dnaE is from a Streptococcus bacterium or a Staphylococcus 
bacterium. 
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SEQUENCE LISTING 
<110> The Rockefeller University 

<12 0> DNA REPLICATION PROTEINS OF GRAM POSITIVE BACTERIA AND 
THEIR USE TO SCREEN FOR CHEMICAL INHIBITORS 

<130> 22221/1022 

<140> 
<141> 

<150> 60/146,178 
<151> 1999-07-29 

<160> 84 

<170> Patentln Ver. 2.1 

<210> 1 
<211> 3195 
<212> DNA 

<213> Staphylococcus aureus 



<400> 1 



atggtggcat 


atttaaatat 


tcatacggct 


tatgatttgt 


taaattcaag 


cttaaaaata 


60 


gaagatgccg 


taagacttgc 


tgtgtctgaa 


aatgttgatg 


cacttgccat 


aactgacacc 


120 


aatgtattgt 


atggttttcc 


taaattttat 


gatgcatgta 


tagcaaataa 


cattaaaccg 


180 


atttttggta 




tgtgacaaat 


ggattaaata 


cagtcgaaac 


agttgttcta 


240 


gctaaaaata 


atgatggatt 


aaaagatttg 


tatcaactat 


catcggaaat 


aaaaatgaat 


300 


gcattagaac 


atgtgtcgtt 


tgaattatta 


aaacgatttt 


ctaacaatat 


gattatcatt 


360 


tttaaaaaag 


tcggtgatca 


acatcgtgat 


attgtacaag 


tgtttgaaac 


ccataatgac 


420 


acatatatgg 


accaccttag 


tatttcgatt 


caaggtagaa 


aacatgtttg 


gattcaaaat 


480 


gtttgttacc 


aaacacgtca 


agatgccgat 


acgatttctg 


cattagcagc 


tattagagac 


540 


aatacaaaat 


tagac ttaat 


tcatgatcaa 


gaagattttg 


gtgcacattt 


tttaactgaa 


600 


aaggaaatta 


atcaattaga 




gaatatttaa 


cgcaggttga 


tgttatagct 


660 


caaaagtgtg 


atgcagaatt 


aaaatatcat 


caatctctac 


ttcctcaata 


tgagacacct 


720 


aatgatgaat 


cagctaaaaa 


atatttgtgg 


cgtgtcttag 


ttacacaatt 


gaaaaaatta 


780 


gaacttaatt 


atgacgtcta 


tttagagcga 


ttgaaatatg 


agtataaagt 


tattactaat 


840 


atgggttttg 


aagattattt 


cttaatagta 


agtgatttaa 


tccattatgc 


gaaaacgaat 


900 


gatgtgatgg 


taggtcctgg 


tcgtggttct 


tcagctggct 


cactggtcag 


ttatttattg 


960 


ggaattacaa 


cgattgatcc 


tattaaattc 


aatctattat 


ttgaacgttt 


tttaaaccca 


1020 


gaacgtgtaa 


caatgcctga 


tattgatatt 


gactttgaag 


atacacgccg 


agaaagggtc 


1080 


attcagtacg 


tccaagaaaa 


atatggcgag 


ctacatgtat 


ctggaattgt 


gactttcggt 


1140 


catctgcttg 


caagagcagt 


tgctagagat 


gttggaagaa 


ttatggggtt 


tgatgaagtt 


1200 


acattaaatg 


aaatttcaag 


tttaatccca 


cataaattag 


gaattacact 


tgatgaagca 


1260 


tatcaaattg 


acgattttaa 


agagtttgta 


catcgaaacc 


atcgacatga 


acgctggttc 


1320 


agtatttgta 


aaaagttaga 


aggtttacca 


agacatacat 


ctacacatgc 


ggcaggaatt 


1380 
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attattaatg accatccatt atatgaatat 
ttaacgcaat ggacaatgac tgaagccgaa 
gggttgagaa acttatcgat tattcatcaa 
attaatattg atatcgaaaa gattccgttt 
caaggagata cgactggcat attccaatta 
aaattaaagc cggaacactt tgaagatatt 
ccaatggaag aaattccaac ttacattaca 
ttacatccgc atttagaacc tatattaaaa 
caaattatgc aaatagcgag cacatttgca 
agaagagcaa tgagtaaaaa aaatagagct 
gaaggtgcaa agcaaaatgg ttatcacgaa 
ctgaaatttg ctgattatgg ttttcctaga 
tacattatga gctttttaaa agtccattat 
aatgttattg gaagtgagaa gaaaactgct 
atcactatat tgccaccgaa cattaacgaa 
ggcatttatt tatcaattgg tacaattaaa 
gttgatgaac gttatcagaa cggcaaattt 
ccgaagagag tcaaaacgag aaagttactt 
gcttttggta aaacacgttc aacgttgttg 
ttaaacattg aacaagatgg ttttttattt 
gataaagaag aattgcctga tgcacttatt 
tatgtttcgc aacacccagt agataaaaag 
aaattgagta acgcgcagaa ttataaacct 
attcgaacta aaaatggtca aaatatggca 
ttagatggtg tgattttccc taatcagttt 
gacttgttta tagttagcgg gaaatttgac 
aatgagattc agacattagc cacttttgaa 
ataattagaa ataaatcaca aatagatatg 
aatgctaatg atgttgtgtt atccttttat 
ggctatatta atcaaaaaga tagtatgttt 
gatattaggc ttata 



gcccctttaa cgaaagggga tacaggatta 1440 
cgtattgggt tattaaaaat agattttcta 1500 
atcttaacac aagtcaaaaa agatttaggt 1560 
gatgatcaaa aagtgtttga attgttgtcg 1620 
gagtctgacg gtgtaagaag tgtattaaaa 1680 
gttgctgtaa cttctttgta tagaccaggt 1740 
agaagacatg atccaagcaa agttcaatat 1800 
aatacttacg gtgttattat ttatcaagag 1860 
aacttcagtt atggtgaagc ggatatttta 1920 
gttcttgaaa gtgagcgtca acattttata 1980 
gacattagta agcaaatatt tgatttgatt 2040 
gcacatgctg tcagctattc taaaattgca 2100 
ccaaattatt tttacgcaaa tattttaagt 2160 
caaatgatag aagaagcaaa aaaacaaggt 2220 
agtcattggt tttataaacc ttcccaagaa 2280 
ggtgttggtt atcaaagtgt gaaagtgatt 2340 
aaagatttct ttgattttgc tagacgtata 2400 
gaagcactga ttttagtggg agcgtttgat 2460 
caagctattg atcaagtgtt ggatggcgat 2520 
gatattttaa cgccaaaaca gatgtatgaa 2580 
agtcagtacg aaaaagaata tttaggattt 2640 
tttgttgcca aacaatattt aacgatattt 2700 
atattagtac agtttgataa agttaaacaa 2760 
ttcgtcacat taaatgatgg cattgaaact 2820 
aaaaagtacg aagagttgtt atcacataat 2880 
catagaaagc aacaacgtca actaattata 2940 
gaacaaaaat tagcatttgc caaacaaatt 3000 
tttgaagaga tgattaaagc tacgaaagag 3060 
gatgaaacga ttaaacaaat gactacttta 3120 
aataatttta tacaatcctt taaccctagt 3180 
3195 



<210> 2 
<211> 1065 
<212> PRT 

<213> Staphylococcus aureus 
<400> 2 

Met Val Ala Tyr Leu Asn lie His Thr Ala Tyr Asp Leu Leu Asn Ser 
15 10 15 

Ser Leu Lys He Glu Asp Ala Val Arg Leu Ala Val Ser Glu Asn Val 
20 25 30 

Asp Ala Leu Ala He Thr Asp Thr Asn Val Leu Tyr Gly Phe Pro Lys 
35 40 45 
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Phe Tyr Asp Ala Cya lie Ala Asn Asn lie Lys Pro lie Phe Gly Met 



Thr lie Tyr Val Thr Asn Gly Leu Asn Thr Val Glu Thr Val Val Leu 
65 70 75 80 

Ala Lys Asn Asn Asp Gly Leu Lys Asp Leu Tyr Gin Leu Ser Ser Glu 
85 90 95 

lie Lys Met Asn Ala Leu Glu His Val Ser Phe Glu Leu Leu Lys Arg 
100 105 110 

Phe Ser Asn Asn Met He lie He Phe Lys Lys Val Gly Asp Gin His 
115 120 125 

Arg Asp He Val Gin Val Phe Glu Thr His Asn Asp Thr Tyr Met Asp 
130 135 140 

His Leu Ser He Ser He Gin Gly Arg Lys His Val Trp He Gin Asn 
145 150 155 160 

Val Cys Tyr Gin Thr Arg Gin Asp Ala Asp Thr He Ser Ala Leu Ala 
165 170 175 

Ala He Arg Acp Asn Thr Lys Leu Asp Leu He His Asp Gin Glu Asp 
180 185 190 

Phe Gly Ala His Phe Leu Thr Glu Lys Glu He Asn Gin Leu Asp He 
195 200 205 

Asn Gin Glu Tyr Leu Thr Gin Val Asp Val He Ala Gin Lys Cys Asp 
210 215 220 

Ala Glu Leu Lys Tyr His Gin Ser Leu Leu Pro Gin Tyr Glu Thr Pro 
225 230 235 240 

Asn Asp Glu Ser Ala Lys Lye Tyr Leu Trp Arg Val Leu Val Thr Gin 
245 250 255 

Leu Lys Lys Leu Glu Leu Asn Tyr Asp Val Tyr Leu Glu Arg Leu Lys 
260 265 270 

Tyr Glu Tyr Lys Val He Thr Asn Met Gly Phe Glu Asp Tyr Phe Leu 
275 280 285 

He Val Ser Asp Leu He His Tyr Ala Lys Thr Asn Asp Val Met Val 
290 295 300 
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Gly Pro Gly Arg Gly Ser Ser Ala Gly Ser Leu Val Ser Tyr Leu Leu 
305 310 315 320 

Gly He Thr Thr He Asp Pro He Lys Phe Asn Leu Leu Phe Glu Arg 

325 330 335 

Phe Leu Asn Pro Glu Arg Val Thr Met Pro Asp lie Asp He Asp Phe 
340 345 350 

Glu Asp Thr Arg Arg Glu Arg Val He Gin Tyr Val Gin Glu Lye Tyr 
355 360 365 

Gly Glu Leu His Val Ser Gly He Val Thr Phe Gly His Leu Leu Ala 
370 375 380 

Arg Ala Val Ala Arg Asp Val Gly Arg He Met Gly Phe Asp Glu Val 
385 390 395 400 

Thr Leu Asn Glu He Ser Ser Leu He Pro His Lys Leu Gly He Thr 
405 410 415 

Leu Asp Glu Ala Tyr Gin He Asp Asp Phe Lys Glu Phe Val His Arg 
420 425 430 

Asn His Arg His Glu Arg Trp Phe Ser He Cys Lys Lya Leu Glu Gly 
435 440 445 

Leu Pro Arg His Thr Ser Thr His Ala Ala Gly He He He Asn Asp 
450 455 460 

His Pro Leu Tyr Glu Tyr Ala Pro Leu Thr Lys Gly Asp Thr Gly Leu 
465 470 475 480 

Leu Thr Gin Trp Thr Met Thr Glu Ala Glu Arg He Gly Leu Leu Lys 
485 490 495 

He Asp Phe Leu Gly Leu Arg Asn Leu Ser He He His Gin He Leu 
500 505 510 

Thr Gin Val Lys Lys Asp Leu Gly He Asn He Asp He Glu Lys He 
515 520 525 

Pro Phe Asp Asp Gin Lys Val Phe Glu Leu Leu Ser Gin Gly Asp Thr 
530 535 540 

Thr Gly He Phe Gin Leu Glu Ser Asp Gly Val Arg Ser Val Leu Lys 
545 550 555 560 
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Lys Leu Lys Pro Glu His Phe Glu Asp lie Val Ala Val Thr Ser Leu 
565 570 575 

Tyr Arg Pro Gly Pro Met Glu Glu lie Pro Thr Tyr lie Thr Arg Arg 
580 585 590 

His Asp Pro Ser Lys Val Gin Tyr Leu His Pro His Leu Glu Pro lie 
595 600 605 

Leu Lys Asn Thr Tyr Gly Val lie lie Tyr Gin Glu Gin lie Met Gin 
610 615 620 

He Ala Ser Thr Phe Ala Asn Phe Ser Tyr Gly Glu Ala Asp He Leu 
625 630 635 640 

Arg Arg Ala Met Ser Lys Lys Asn Arg Ala Val Leu Glu Ser Glu Arg 
645 650 655 

Gin His Phe He Glu Gly Ala Lys Gin Asn Gly Tyr His Glu Asp He 
660 665 670 

Ser Lys Gin He Phe Asp Leu He Leu Lys Phe Ala Asp Tyr Gly Phe 
675 680 685 

Pro Arg Ala His Ala Val Ser Tyr Ser Lys He Ala Tyr He Met Ser 
690 695 700 

Phe Leu Lys Val His Tyr Pro Asn Tyr Phe Tyr Ala Asn He Leu Ser 
705 710 715 720 

Asn val He Gly Ser Glu Lys Lys Thr Ala Gin Met He Glu Glu Ala 
725 730 735 

Lys Lys Gin Gly He Thr He Leu Pro Pro Asn He Asn Glu Ser His 
740 745 750 

Trp Phe Tyr Lys Pro Ser Gin Glu Gly He Tyr Leu Ser He Gly Thr 
755 760 765 

He Lys Gly Val Gly Tyr Gin Ser Val Lye Val He Val Asp Glu Arg 
770 775 780 

Tyr Gin Asn Gly Lys Phe Lys Asp Phe Phe Asp Phe Ala Arg Arg He 
785 790 795 800 

Pro Lys Arg Val Lys Thr Arg Lys Leu Leu Glu Ala Leu He Leu Val 
805 810 815 
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Gly Ala Phe Asp Ala Phe Gly Lys Thr Arg Ser Thr Leu Leu Gin Ala 
820 825 830 

lie Asp Gin Val Leu Asp Gly Asp Leu Asn lie Glu Gin Asp Gly Phe 
835 840 845 

Leu Phe Asp lie Leu Thr Pro Lys Gin Met Tyr Glu Asp Lys Glu Glu 
850 855 860 

Leu Pro Asp Ala Leu He Ser Gin Tyr Glu Lys Glu Tyr Leu Gly Phe 
865 870 875 880 

Tyr Val Ser Gin His Pro Val Asp Lys Lys Phe Val Ala Lys Gin Tyr 
885 890 895 

Leu Thr He Phe Lys Leu Ser Asn Ala Gin Asn Tyr Lys Pro He Leu 
900 905 910 

Val Gin Phe Asp Lys Val Lys Gin He Arg Thr Lys Asn Gly Gin Asn 
915 920 925 

Met Ala Phe Val Thr Leu Asn Asp Gly He Glu Thr Leu Asp Gly Val 
930 935 940 

He Phe Pro Asn Gin Phe Lys Lys Tyr Glu Glu Leu Leu Ser His Asn 
945 950 955 960 

Asp Leu Phe He Val Ser Gly Lys Phe Asp His Arg Lys Gin Gin Arg 
965 970 975 

Gin Leu He He Asn Glu He Gin Thr Leu Ala Thr Phe Glu Glu Gin 
980 985 990 

Lys Leu Ala Phe Ala Lys Gin He He He Arg A6n Lys Ser Gin He 
995 1000 1005 

Asp Met Phe Glu Glu Met He Lys Ala Thr Lys Glu Asn Ala Asn Asp 
1010 1015 1020 

Val Val Leu Ser Phe Tyr Asp Glu Thr He Lys Gin Met Thr Thr Leu 
1025 1030 1035 1040 

Gly Tyr He Asn Gin Lys Asp Ser Met Phe Asn Asn Phe He Gin Ser 
1045 1050 1055 

Phe Asn Pro Ser Asp He Arg Leu He 
1060 1065 
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<210> 3 
<211> 1698 
<212> DNA 

<213> Staphylococcus aureus 
<400> 3 

ttgaattatc aagccttata tcgtatgtac 
caagaacatg tcacgaagac attgcgcaat 
tatattttta gtggtccgag aggtacgggg 
gcaatcaact gtttaaatag cactgatgga 
ggcattacgc aggggactaa ttcagatgtg 
gttgatgaaa taagaaatat tagagacaaa 
aaagtttata ttatagatga ggtgcacatg 
aagacgttag aagaacctcc agcacacgct 
aaaatccctc caacaatcat ttctagggca 
gatcaaattg ttgaacgttt aaaatttgta 
gaagccttgg catttatcgc taaagcgtct 
atggatcagg ctattgcttt cggcgatggc 
acgggtagcg ttcatgatga agcgttggat 
gtacaagcat cttttaaaaa ataccatcag 
ctaataaatg atatgattta ttttgtcaga 
gatactgagt atcgagcact gatgaactta 
cttattaatg atacattagt gtcgattcgt 
gtattgttag taaaattagc tgagcagatt 
gctgaaccag cacaaattgc ttcatcgcca 
cagttagagc aagaactaaa aacactaaaa 
aaatcttcga aaaagcctgc gagaggtata 
caaattgcaa aagtgctaga taaagcgaat 
tggcaagaag tgattgacca tgcccaaaac 
caaaattcgg aacctgtggc ggcaagtgaa 
atccattgtg aaatcgtcaa taaagacgac 
tgtaatatcg ttaataaaaa cgttaaagtt 
gttcgaacgg agtatttaca aaatcgtaaa 
gcacaacaaa cagatattgc tcaaaaagca 
gtgatagatg aagagtga 



agaccccaaa gtttcgagga tgtcgtcgga 60 
gcgatttcga aagaaaaaca gtcgcatgca 120 
aaaacgagta ttgccaaagt gtttgctaaa 180 
gaaccttgta atgaatgtca tatttgtaaa 240 
atagaaattg atgctgctag taataatggc 3 00 
gttaaatatg caccaagtga atcgaaatat 3 60 
ctaacaacag gtgcttttaa tgccctttta 420 
atttttatat tggcaacgac agaaccacat 480 
caacgttttg attttaaagc aattagccta 540 
gcagatgcac aacaaattga atgtgaagat 600 
gaagggggta tgcgtgatgc attaagtatt 660 
acattgacat tacaagatgc cctaaatgtt 720 
cacttgtttg atgatattgt acaaggtgac 7 80 
tttataacag aaggtaaaga agtgaatcgc 840 
gatacgatta tgaataaaac atctgagaaa 900 
gaattagata tgttatatca aatgattgat 960 
tttagtgtga atcaaaacgt tcattttgaa 1020 
aagggtcaac cacaagtgat tgcgaatgta 1080 
aacacagatg tattgttgca acgtatggaa 1140 
gcacaaggag tgagtgttgc tcctactcaa 1200 
caaaaatcta aaaatgcatt ttcaatgcaa 1260 
aaggcagata tcaaattgtt gaaagatcat 1320 
aatgataaaa aatcactcgt tagtttattg 1380 
gatcacgtcc ttgtgaaatt tgaggaagag 1440 
gagaaacgta gtagtataga aagtgttgta 1500 
gttggtgtac catcagatca atggcaaaga 1560 
aacgaaggcg atgatatgcc aaagcaacaa 1620 
aaagatcttt tcggtgaaga aactgtacat 1680 
1698 



<210> 4 
<211> 566 
<212> PRT 

<213> Staphylococcus aureus 
<400> 4 

Leu Asn Tyr Gin Ala Leu Tyr Arg Met Tyr Arg Pro Gin Ser Phe Glu 
15 10 15 

Asp Val Val Gly Gin Glu His Val Thr Lys Thr Leu Arg Asn Ala lie 
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20 25 30 

Ser Lys Glu Lys Gin Ser His Ala Tyr lie Phe Ser Gly Pro Arg Gly 
35 40 45 

Thr Gly Lys Thr Ser lie Ala Lys Val Phe Ala Lys Ala lie Asn Cys 
50 55 60 

Leu Asn Ser Thr Asp Gly Glu Pro Cys Asn Glu Cys His lie Cys Lys 



Gly lie Thr Gin Gly Thr Asn Ser Asp Val He Glu He Asp Ala Ala 
85 90 95 

Ser Asn Asn Gly Val Asp Glu He Arg Asn He Arg Asp Lys Val Lys 
100 105 110 

Tyr Ala Pro Ser Glu Ser Lys Tyr Lys Val Tyr He He Asp Glu Val 
115 120 125 

His Met Leu Thr Thr Gly Ala Phe Asn Ala Leu Leu Lys Thr Leu Glu 
130 135 140 

Glu Pro Pro Ala His Ala He Phe He Leu Ala Thr Thr Glu Pro His 
145 150 155 160 

Lys He Pro Pro Thr He He Ser Arg Ala Gin Arg Phe Asp Phe Lys 
165 170 175 

Ala He Ser Leu Asp Gin He Val Glu Arg Leu Lys Phe Val Ala Asp 
180 185 190 

Ala Gin Gin He Glu Cys Glu Asp Glu Ala Leu Ala Phe He Ala Lys 
195 200 205 

Ala Ser Glu Gly Gly Met Arg Asp Ala Leu Ser He Met Asp Gin Ala 
210 215 220 

He Ala Phe Gly Asp Gly Thr Leu Thr Leu Gin Asp Ala Leu Asn Val 
225 230 235 240 

Thr Gly Ser Val His Asp Glu Ala Leu Asp His Leu Phe Asp Asp He 
245 250 255 

Val Gin Gly Asp Val Gin Ala Ser Phe Lys Lys Tyr His Gin Phe He 
260 265 270 

Thr Glu Gly Lys Glu Val Asn Arg Leu He Aan Asp Met He Tyr Phe 
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275 280 285 

Val Arg Asp Thr He Met Asn Lys Thr Ser Glu Lys Abp Thr Glu Tyr 
290 295 300 

Arg Ala Leu Met Asn Leu Glu Leu Asp Met Leu Tyr Gin Met He Asp 
305 310 315 320 

Leu He Asn Asp Thr Leu Val Ser He Arg Phe Ser Val Asn Gin Asn 
325 330 335 

Val His Phe Glu Val Leu Leu Val Lys Leu Ala Glu Gin He Lys Gly 
340 345 350 

Gin Pro Gin Val He Ala Asn Val Ala Glu Pro Ala Gin He Ala Ser 
355 360 365 

Ser Pro Asn Thr Asp Val Leu Leu Gin Arg Met Glu Gin Leu Glu Gin 
370 375 380 

Glu Leu Lys Thr Leu Lys Ala Gin Gly Val Ser Val Ala Pro Thr Gin 
385 390 395 400 

Lys Ser Ser Lys Lys Pro Ala Arg Gly He Gin Lys Ser Lys Asn Ala 
405 410 415 

Phe Ser Met Gin Gin He Ala Lys Val Leu Asp Lys Ala Asn Lys Ala 
420 425 430 

Asp He Lys Leu Leu Lys Asp His Trp Gin Glu Val He Asp His Ala 
435 440 445 

Gin Asn Asn Asp Lys Lys Ser Leu Val Ser Leu Leu Gin Asn Ser Glu 
450 455 460 

Pro Val Ala Ala Ser Glu Asp His Val Leu Val Lys Phe Glu Glu Glu 
465 470 475 480 

He His CyB Glu He Val Asn Lys Asp Asp Glu Lys Arg Ser Ser He 
485 490 495 

Glu Ser Val Val Cys Asn He Val Asn Lys Asn Val Lys Val Val Gly 
500 505 510 

Val Pro Ser Asp Gin Trp Gin Arg Val Arg Thr Glu Tyr Leu Gin Asn 
515 520 525 

Arg Lys Asn Glu Gly Asp Asp Met Pro Lys Gin Gin Ala Gin Gin Thr 
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530 535 540 

Asp lie Ala Gin Lys Ala Lye Asp Leu Phe Gly Glu Glu Thr Val His 
54 5 550 555 560 

Val lie Asp Glu Glu Glx 
565 



<210> 5 
<211> 1398 
<212> DNA 

<213> Staphylococcus aureus 
<400> 5 

atggatagaa tgtatgagca aaatcaaatg 
ttaggttcaa ttattataga tccagaattg 
gagtcgtttt ataggggtgc ccatcaacat 
gataataaag aaattgatgt tgtaacattg 
aatgaagcgg gtggcccgca atatcttgca 
aatgttcagt attatactga tatcgtttct 
actgcagata gtattgccaa tgatggatat 
agtgatgcag aacgtcgaat tttagagcta 
gacattcgag acgtcttagg acaagtgtat 
ggtcaaacac caggtatacc tacaggatat 
aaccgaaatg atttaattat ccttgcagcg 
cttaatattg cacaaaaagt tgcaacgcat 
ctagagatgg gtgctgatca gttagccaca 
tcaaaccgct taagaacggg tactatgact 
gtaggtaaat tatcacgtac gaagattttt 
gatttacgtt ctaaatgtcg tcgattaaag 
gactacttac agttgattca aggtagtggt 
gtttctgaaa tctctcgtac attaaaagca 
gcattaagtc agttatctcg tggtgttgaa 
gatattcgtg aatctggttc gattgagcaa 
gatgattact ataaccgtgg cggcgatgaa 
caaacgaatg atgaaaacgg tgaaattgaa 
acaggcacag ttaagttaca ttttatgaaa 
gcacatgcag atatgatg 



ccgcataaca atgaagctga acagtctgtc 60 
attaatacta ctcaggaagt tttgcttcct 120 
attttccgtg caatgatgca cttaaatgaa 180 
atggatcaat tatcgacgga aggtacgttg 240 
gagttatcta caaatgtacc aacgacgcga 3 00 
aagcatgcat taaaacgtag attgattcaa 3 60 
aatgatgaac ttgaactaga tgcgatttta 420 
tcatcttctc gtgaaagcga tggctttaaa 480 
gaaacagctg aagagcttga tcaaaatagt 540 
cgagatttag accaaatgac agcagggttc 600 
cgtccatctg taggtaagac tgcgttcgca 660 
gaagatatgt atacagttgg tattttctcg 720 
cgtatgattt gtagttctgg aaatgttgac 780 
gaggaagatt ggagtcgttt tactatagcg 840 
attgatgata caccgggtat tcgaattaat 900 
caagaacatg gcttagacat gattgtgatt 960 
tcacgtgcgt ccgataacag acaacaggaa 1020 
ttagcccgtg aattaaaatg tccagttatc 1080 
caacgacaag ataaacgtcc aatgatgagt 114 0 
gatgccgata tcgttgcatt cttataccgt 1200 
gatgatgacg atgatggtgg tttcgagcca 12 60 
attatcattg ctaagcaacg taacggtcca 1320 
caatataata aatttaccga tatcgattat 13 80 
1398 



<210> 6 
<211> 466 
<212> PRT 

<213> Staphylococcus aureus 
<400> 5 

Met Asp Arg Met Tyr Glu Gin Asn Gin Met Pro His Asn Asn Glu Ala 
10 



WO 01/09164 



PCT/L'SOO/20666 



15 10 15 

Glu Gin Ser Val Leu Gly Ser lie lie lie Asp Pro Glu Leu lie Asn 
20 25 30 

Thr Thr Gin Glu Val Leu Leu Pro Glu Ser Phe Tyr Arg Gly Ala Hi6 
35 40 45 

Gin His lie Phe Arg Ala Met Met His Leu Asn Glu Asp Asn Lys Glu 
50 55 60 

lie Asp Val Val Thr Leu Met Asp Gin Leu Ser Thr Glu Gly Thr Leu 
65 70 75 80 

Asn Glu Ala Gly Gly Pro Gin Tyr Leu Ala Glu Leu Ser Thr Asn Val 
85 90 95 

Pro Thr Thr Arg Asn Val Gin Tyr Tyr Thr Asp lie Val Ser Lys His 
100 105 110 

Ala Leu Lys Arg Arg Leu lie Gin Thr Ala Asp Ser lie Ala Asn Asp 
115 120 125 

Gly Tyr Asn Asp Glu Leu Glu Leu Asp Ala lie Leu Ser Asp Ala Glu 
130 135 140 

Arg Arg lie Leu Glu Leu Ser Ser Ser Arg Glu Ser Asp Gly Phe Lys 
145 150 155 160 

Asp He Arg Asp Val Leu Gly Gin Val Tyr Glu Thr Ala Glu Glu Leu 
165 170 175 

Asp Gin Asn Ser Gly Gin Thr Pro Gly He Pro Thr Gly Tyr Arg Asp 
180 185 190 

Leu Asp Gin Met Thr Ala Gly Phe Asn Arg Asn Asp Leu He lie Leu 
195 200 205 

Ala Ala Arg Pro Ser Val Gly Lys Thr Ala Phe Ala Leu Asn lie Ala 
210 215 220 

Gin Lys Val Ala Thr His Glu Asp Met Tyr Thr Val Gly lie Phe Ser 
225 230 235 240 

Leu Glu Met Gly Ala Asp Gin Leu Ala Thr Arg Met He Cys Ser Ser 
245 250 255 

Gly Asn Val Asp Ser Asn Arg Leu Arg Thr Gly Thr Met Thr Glu Glu 
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260 265 270 

Asp Trp Ser Arg Phe Thr lie Ala Val Gly Lys Leu Ser Arg Thr Lys 
275 280 285 

lie Phe lie Asp Asp Thr Pro Gly lie Arg lie Asn Asp Leu Arg Ser 
290 295 300 

Lys Cys Arg Arg Leu Lys Gin Glu His Gly Leu Asp Met lie Val He 
305 310 315 320 

Asp Tyr Leu Gin Leu He Gin Gly Ser Gly Ser Arg Ala Ser Asp Asn 
325 330 335 

Arg Gin Gin Glu Val Ser Glu lie Ser Arg Thr Leu Lys Ala Leu Ala 
340 345 350 

Arg Glu Leu Lys Cys Pro Val He Ala Leu Ser Gin Leu Ser Arg Gly 
355 360 365 

Val Glu Gin Arg Gin Asp Lys Arg Pro Met Met Ser Asp He Arg Glu 
370 375 380 

Ser Gly Ser He Glu Gin Asp Ala Asp He Val Ala Phe Leu Tyr Arg 
385 390 395 400 

Asp Asp Tyr Tyr Asn Arg Gly Gly Asp Glu Asp Asp Asp Asp Asp Gly 
405 410 415 

Gly Phe Glu Pro Gin Thr Asn Asp Glu Asn Gly Glu He Glu He He 
420 425 430 

He Ala Lys Gin Arg Asn Gly Pro Thr Gly Thr Val Lys Leu His Phe 
435 440 445 

Met Lys Gin Tyr Asn Lys Phe Thr Asp He Asp Tyr Ala His Ala Asp 
450 455 460 

Met Met 
465 



<210> 7 
<211> 4308 
<212> DNA 

<213> Staphylococcus aureus 
<400> 7 
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atgacagagc aacaaaaatt taaagtgctt gctgatcaaa ttaaaatttc aaatcaatta 60 
gatgctgaaa ttttaaattc aggtgaactg acacgtatag atgtttctaa caaaaacaga 120 
acatgggaat ttcatattac attaccacaa ttcttagctc atgaagatta tttattattt 180 
ataaatgcaa tagagcaaga gtttaaagat atcgccaacg ttacatgtcg ttttacggta 240 
acaaatggca cgaatcaaga tgaacatgca attaaatact ttgggcactg tattgaccaa 300 
acagctttat ctccaaaagt taaaggtcaa ttgaaacaga aaaagcttat tatgtctgga 3 60 
aaagtattaa aagtaatggt atcaaatgac attgaacgta atcattttga taaggcatgt 420 
aatggaagtc ttatcaaagc gtttagaaat tgtggttttg atatcgataa aatcatattc 480 
gaaacaaatg ataatgatca agaacaaaac ttagcttctt tagaagcaca tattcaagaa 540 
gaagacgaac aaagtgcacg attggcaaca gagaaacttg aaaaaatgaa agctgaaaaa 600 
gcgaaacaac aagataacaa cgaaagtgct gtcgataagt gtcaaattgg taagccgatt 660 
caaattgaaa atattaaacc aattgaatct attattgagg aagagtttaa agttgcaata 720 
gagggtgtca tttttgatat aaacttaaaa gaacttaaaa gtggtcgcca tatcgtagaa 780 
attaaagtga ctgactatac ggactcttta gttttaaaaa tgtttactcg taaaaacaaa 840 
gatgatttag aacattttaa agcgctaagt gttggtaaat gggttagggc tcaaggtcgt 900 
attgaagaag atacatttat tagagattta gttatgatga tgtctgatat tgaagagatt 960 
aaaaaagcga caaaaaaaga taaggctgaa gaaaagcgtg tagaattcca cttgcatact 1020 
gcaatgagcc aaatggatgg tatacccaat attggtgcgt atgttaaaca ggcagcagac 1080 
tggggacatc cagccattgc ggttacagac cataatgttg tgcaagcatt tccagatgct 1140 
cacgcagcag cggaaaaaca tggcattaaa atgatatacg gtatggaagg tatgttagtt 1200 
gatgatggtg ttccgattgc atacaaacca caagatgtcg tattaaaaga tgctacttat 1260 
gttgtgttcg acgttgagac aactggttta tcaaatcagt atgataaaat catcgagctt 1320 
gcagctgtga aagttcataa cggtgaaatc atcgataagt ttgaaaggtt tagtaatccg 1380 
catgaacgat tatcggaaac gattatcaat ttgacgcata ttactgatga tatgttagta 1440 
gatgcccctg agattgaaga agtacttaca gagtttaaag aatgggttgg cgatgcgata 1500 
ttcgtagcgc ataatgcttc gtttgatatg ggcttcatcg atacgggata tgaacgtctt 1560 
gggtttggac catcaacgaa tggtgttatc gatactttag aattatctcg tacgattaat 1620 
actgaatatg gtaaacatgg tttgaatttc ttggctaaaa aatatggcgt agaattaacg 1680 
caacatcacc gtgccattta tgatacagaa gcaacagctt acattttcat aaaaatggtt 1740 
caacaaatga aagaattagg cgtattaaat cataacgaaa tcaacaaaaa actcagtaat 1800 
gaagatgcat ataaacgtgc aagacctagt catgtcacat taattgtaca aaaccaacaa 1860 
ggtcttaaaa atctatttaa aattgtaagt gcatcattgg tgaagtattt ctaccgtaca 1920 
cctcgaattc cacgttcatt gttagatgaa tatcgtgagg gattattggt aggtacagcg 1980 
tgtgatgaag gtgaattatt tacggcagtt atgcagaagg accagagtca agttgaaaaa 2040 
attgccaaat attatgattt tattgaaatt caaccaccgg cactttatca agatttaatt 2100 
gatagagagc ttattagaga tactgaaaca ttacatgaaa tttatcaacg tttaatacat 2160 
gcaggtgaca cagcgggtat acctgttatt gcgacaggaa atgcacacta tttgtttgaa 2220 
catgatggta tcgcacgtaa aattttaata gcatcacaac ccggcaatcc acttaatcgc 2280 
tcaactttac cggaagcaca ttttagaact acagatgaaa tgttaaacga gtttcatttt 234 0 
ttaggtgaag aaaaagcgca tgaaattgtt gtgaaaaata caaacgaatt agcagatcga 2400 
attgaacgtg ttgttcctat taaagatgaa ttatacacac cgcgtatgga aggtgctaac 2460 
gaagaaatta gagaactaag ttatgcaaat gcgcgtaaac tgtatggtga agacctgcct 2520 
caaatcgtaa ttgatcgatt agaaaaagaa ttaaaaagta ttatcggtaa tggatttgcg 2580 
gtaatttact taatttcgca acgtttagtt aaaaaatcat tagatgatgg atacttagtt 2640 
ggttcccgtg gttcagtagg ttctagtttt gtagcgacaa tgactgagat tactgaagta 2700 
aacccgttac cgccacacta tatttgtccg aactgtaaaa cgagtgaatt tttcaatgat 2760 
ggttcagtag gatcaggatt tgatttacct gataagacgt gtgaaacttg tggagcgcca 2820 
cttattaaag aaggacaaga tattccgttt gaaacatttt taggatttaa gggagataaa 2880 
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gttcctgata tcgacttaaa ctttagtggt 
aaagtattat ttggtgagga taaagtattc 
aagactgctt ttggttatgt taaaggttat 
gctgaaatag atcgactcgt taaaggatgt 
ccagggggta ttattgtagt acctgattac 
tatcctgccg atgatcaaaa ttcagcatgg 
catgataatg tattaaaact tgatatactt 
cttcaagatt tatcaggaat tgatccaaaa 
cagatattta gtacacctga aagtttgggt 
ggtacatttg gggtaccaga attcggtaca 
aagccaacaa cattttctga attagttcaa 
tggttaggca atgctcaaga attaattaaa 
ggttgtcgtg atgatatcat ggtttattta 
tttaaaataa tggagtcagt acgtaaaggt 
atgaaagaaa atgaagtgcc agattggtat 
ttccctaaag cccatgcagc agcatacgtt 
gtacatcatc cactttatta ctatgcatct 
ttaatcacga tgattaaaga taaaacaagc 
cgctatatgg atctaggtaa aaaagaaaaa 
gaaatggcgc atcgaggtta tcgaatgcaa 
gaatttatca ttgaaggcga tacacttatt 
gaaaacgttg cgaaacgaat tgttgaagct 
gatttaaaca aaaaagctgg attatctcag 
tcattaccga atttaccaga taaagctcaa 



gaatatcaac cgaatgccca taactacaca 2940 
cgtgcaggta caattggtac tgttgctgaa 3000 
ttgaatgatc aaggtatcca caaaagaggt 3060 
acaggtgtta aacgtacaac tggacagcat 312 0 
atggatattt atgattttac gccgatacaa 3180 
atgacgacac attttgattt ccattctatt 3240 
ggacacgatg atccaacaat gattcgtatg 3300 
acaatacctg tagatgataa agaagttatg 3350 
gttactgaag atgaaatttt atgtaaaaca 3420 
ggattcgtgc gtcaaatgtt agaagataca 3480 
atctcaggat tatctcatgg tacagatgtg 3540 
accggtatat gtgatttatc aagtgtaatt 3600 
atgtatgctg gtttagaacc atcaatggct 3660 
aaaggtttaa ctgaagaaat gattgaaacg 372 0 
ttagattcat gtcttaaaat taagtacatg 3780 
ttaatggcag tacgtatcgc atatttcaaa 3840 
tactttacaa ttcgtgcgtc agactttgat 3900 
attcgaaata ctgtaaaaga catgtattct 3960 
gacgtattaa cagtcttgga aattatgaat 4020 
ccgattagtt tagaaaagag tcaggcgttc 4080 
ccgccgttca tatcagtgcc tgggcttggc 4140 
cgtgacgatg gcccattttt atcaaaagaa 4200 
aaaattattg agtatttaga tgagttaggc 4260 
ctttcgatat ttgatatg 4308 



<210> 8 
<211> 1435 
<212> PRT 

<213> Staphylococcus aureus 
<400> 8 

Met Thr Glu Gin Gin Lys Phe Lys Val Leu Ala Asp Gin lie Lys lie 
15 10 15 

Ser Asn Gin Leu Asp Ala Glu lie Leu Asn Ser Gly Glu Leu Thr Arg 
20 25 30 

lie Asp Val Ser Asn Lye Asn Arg Thr Trp Glu Phe His lie Thr Leu 
35 40 45 

Pro Gin Phe Leu Ala His Glu Asp Tyr Leu Leu Phe lie Asn Ala lie 
50 55 60 

Glu Gin Glu Phe Lye Asp He Ala Asn Val Thr Cys Arg Phe Thr Val 
65 70 75 80 

Thr Asn Gly Thr Asn Gin Asp Glu His Ala He Lys Tyr Phe Gly Hie 
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85 

Cye lie Asp Gin Thr 
100 

Gin Lys Lys Leu He 
115 

Aan Asp He Glu Arg 
130 

He Lys Ala Phe Arg 
145 

Glu Thr Asn Asp Asn 
165 

His He Gin Glu Glu 
180 

Leu Glu Lys Met Lys 
195 

Ser Ala Val Asp Lys 
210 

He Lys Pro He Glu 
225 

Glu Gly Val He Phe 
245 

His He Val Glu He 
260 

Lys Met Phe Thr Arg 
275 

Leu Ser Val Gly Lys 
290 

Thr Phe He Arg Asp 
305 

Lys Lys Ala Thr Lys 
325 

His Leu His Thr Ala 



90 

Ala Leu Ser Pro Lys Val 
105 

Met Ser Gly Lys Val Leu 
120 

Asn His Phe Asp Lys Ala 
135 

Asn Cys Gly Phe Asp He 
150 155 

Asp Gin Glu Gin Asn Leu 
170 

Asp Glu Gin Ser Ala Arg 
185 

Ala Glu Lys Ala Lys Gin 
200 

Cys Gin He Gly Lys Pro 
215 

Ser He He Glu Glu Glu 
230 235 

Asp He Asn Leu Lys Glu 
250 

Lys Val Thr Asp Tyr Thr 
265 

Lys Asn Lys Asp Asp Leu 
280 

Trp Val Arg Ala Gin Gly 
295 

Leu Val Met Met Met Ser 
310 315 

Lys Asp Lys Ala Glu Glu 
330 

Met Ser Gin Met Asp Gly 



95 

Lys Gly Gin Leu Lys 
110 

Lys Val Met Val Ser 
125 

Cys Asn Gly Ser Leu 
140 

Asp Lys He He Phe 
160 

Ala Ser Leu Glu Ala 
175 

Leu Ala Thr Glu Lys 
190 

Gin Asp Asn Lys Gin 
205 

He Gin He Glu Asn 
220 

Phe Lys Val Ala He 
240 

Leu Lys Ser Gly Arg 
255 

Asp Ser Leu Val Leu 
270 

Glu His Phe Lys Ala 
285 

Arg He Glu Glu Asp 
300 

Asp He Glu Glu He 
320 

Lye Arg Val Glu Phe 
335 

He Pro Asn He Gly 
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Ala Tyr Val Lys Gin Ala Ala Asp Trp Gly His Pro Ala lie Ala Val 
355 360 365 

Thr Asp His Asn Val Val Gin Ala Phe Pro Asp Ala His Ala Ala Ala 
370 375 380 

Glu Lye His Gly lie Lys Met He Tyr Gly Met Glu Gly Met Leu Val 
385 390 395 400 

Asp Asp Gly Val Pro lie Ala Tyr Lys Pro Gin Asp Val Val Leu Lys 
405 410 415 

Asp Ala Thr Tyr Val Val Phe Asp Val Glu Thr Thr Gly Leu Ser Asn 
420 425 430 

Gin Tyr Asp Lys He He Glu Leu Ala Ala Val Lys Val His Asn Gly 
435 440 445 

Glu He He Asp Lys Phe Glu Arg Phe Ser Asn Pro His Glu Arg Leu 
450 455 460 

Ser Glu Thr He He Asn Leu Thr His He Thr Asp Asp Met Leu Val 
465 470 475 480 

Asp Ala Pro Glu He Glu Glu Val Leu Thr Glu Phe Lys Glu Trp Val 
485 490 495 

Gly Asp Ala He Phe Val Ala His Asn Ala Ser Phe Asp Met Gly Phe 
500 505 510 

He Asp Thr Gly Tyr Glu Arg Leu Gly Phe Gly Pro Ser Thr Asn Gly 
515 520 525 

Val He Asp Thr Leu Glu Leu Ser Arg Thr He Asn Thr Glu Tyr Gly 
530 535 540 

Lys His Gly Leu Asn Phe Leu Ala Lys Lys Tyr Gly Val Glu Leu Thr 
545 550 555 560 

Gin His His Arg Ala He Tyr Asp Thr Glu Ala Thr Ala Tyr He Phe 
565 570 575 

He Lys Met Val Gin Gin Met Lys Glu Leu Gly Val Leu Asn His Asn 
580 585 590 

Glu He Asn Lys Lys Leu Ser Asn Glu Asp Ala Tyr Lys Arg Ala Arg 
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Pro Ser His Val Thr Leu lie Val Gin Aesn Gin Gin Gly Leu Lys Asn 
610 615 620 

Leu Phe Lys lie Val Ser Ala Ser Leu Val Lys Tyr Phe Tyr Arg Thr 
625 630 635 640 

Pro Arg lie Pro Arg Ser Leu Leu Asp Glu Tyr Arg Glu Gly Leu Leu 
645 650 655 

Val Gly Thr Ala Cys Asp Glu Gly Glu Leu Phe Thr Ala Val Met Gin 
660 665 670 

Lys Asp Gin Ser Gin Val Glu Lys lie Ala Lys Tyr Tyr Asp Phe He 
675 680 685 

Glu He Gin Pro Pro Ala Leu Tyr Gin Asp Leu lie Asp Arg Glu Leu 
690 695 700 

He Arg Asp Thr Glu Thr Leu His Glu He Tyr Gin Arg Leu He His 
705 710 715 720 

Ala Gly Asp Thr Ala Gly He Pro Val He Ala Thr Gly Asn Ala His 
725 730 735 

Tyr Leu Phe Glu His Asp Gly He Ala Arg Lys He Leu He Ala Ser 
740 745 750 

Gin Pro Gly Asn Pro Leu Asn Arg Ser Thr Leu Pro Glu Ala His Phe 
755 760 765 

Arg Thr Thr Asp Glu Met Leu Asn Glu Phe His Phe Leu Gly Glu Glu 
770 775 780 

Lys Ala His Glu He Val Val Lys Asn Thr Asn Glu Leu Ala Asp Arg 
785 790 795 800 

He Glu Arg Val Val Pro He Lys Asp Glu Leu Tyr Thr Pro Arg Met 
805 810 815 

Glu Gly Ala Asn Glu Glu He Arg Glu Leu Ser Tyr Ala Asn Ala Arg 
820 825 830 

Lys Leu Tyr Gly Glu Asp Leu Pro Gin He Val He Asp Arg Leu Glu 
835 840 845 



Lys Glu Leu Lys Ser He He Gly Asn Gly Phe Ala Val He Tyr Leu 
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850 855 860 

He Ser Gin Arg Leu Val Lys Lys Ser Leu Asp Asp Gly Tyr Leu Val 
865 870 875 880 

Gly Ser Arg Gly Ser Val Gly Ser Ser Phe Val Ala Thr Met Thr Glu 
885 890 895 

lie Thr Glu Val Asn Pro Leu Pro Pro His Tyr He Cys Pro Abii Cys 
900 905 910 

Lys Thr Ser Glu Phe Phe Asn Asp Gly Ser Val Gly Ser Gly Phe Asp 
915 920 925 

Leu Pro Asp Lys Thr Cys Glu Thr Cys Gly Ala Pro Leu He Lys Glu 
930 935 940 

Gly Gin Asp He Pro Phe Glu Lys Phe Leu Gly Phe Lys Gly Asp Lys 
945 950 955 960 

Val Pro Asp He Asp Leu Asn Phe Ser Gly Glu Tyr Gin Pro Asn Ala 
965 970 975 

His Asn Tyr Thr Lys Val Leu Phe Gly Glu Asp Lys Val Phe Arg Ala 
980 985 990 

Gly Thr He Gly Thr Val Ala Glu Lys Thr Ala Phe Gly Tyr Val Lys 
995 1000 1005 

Gly Tyr Leu Asn Asp Gin Gly He His Lys Arg Gly Ala Glu He Asp 
1010 1015 1020 

Arg Leu Val Lys Gly Cys Thr Gly Val Lys Ala Thr Thr Gly Gin His 
1025 1030 1035 1040 

Pro Gly Gly He He Val Val Pro Asp Tyr Met Asp He Tyr Asp Phe 
1045 1050 1055 

Thr Pro He Gin Tyr Pro Ala Asp Asp Gin Asn Ser Ala Trp Met Thr 
1060 1065 1070 

Thr His Phe Asp Phe His Ser He His Asp Asn Val Leu Lys Leu Asp . 
1075 1080 1085 

He Leu Gly His Asp Asp Pro Thr Met He Arg Met Leu Gin Asp Leu 
1090 1095 1100 

Ser Gly He Asp Pro Lys Thr He Pro Val Asp Asp Lys Glu Val Met 

18 
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1105 1110 1115 1120 

Gin lie Phe Ser Thr Pro Glu Ser Leu Gly Val Thr Glu Asp Glu lie 
1125 1130 1135 

Leu Cys Lys Thr Gly Thr Phe Gly Val Pro Asn Ser Asp Arg He Arg 
1140 1145 1150 

Arg Gin Met Leu Glu Asp Thr Ly6 Pro Thr Thr Phe Ser Glu Leu Val 
1155 1160 1165 

Gin He Ser Gly Leu Ser His Gly Thr Asp Val Trp Leu Gly Asn Ala 
1170 1175 1180 

Gin Glu Leu lie Lys Thr Gly He Cys Asp Leu Ser Ser Val lie Gly 
1185 1190 1195 1200 

Cys Arg Asp Asp He Met Val Tyr Leu Met Tyr Ala Gly Leu Glu Pro 
1205 1210 1215 

Ser Met Ala Phe Lye He Met Glu Ser Val Arg Lys Gly Lys Gly Leu 
1220 1225 1230 

Thr Glu Glu Met He Glu Thr Met Lys Glu Asn Glu Val Pro Asp Trp 
1235 1240 1245 

Tyr Leu Asp Ser Cys Leu Lys He Lys Tyr He Phe Pro Lys Ala His 
1250 1255 1260 

Ala Ala Ala Tyr Val Leu Met Ala Val Arg He Ala Tyr Phe Lys Val 
1265 1270 1275 1280 

His His Pro Leu Tyr Tyr Tyr Ala Ser Tyr Phe Thr He Arg Ala Ser 
1285 1290 1295 

Asp Phe ABp Leu He Thr Met lie Lys Asp Lys Thr Ser He Arg Asn 
1300 1305 1310 

Thr Val Lys Asp Met Tyr Ser Arg Tyr Met Asp Leu Gly Lys Lys Glu 
1315 1320 1325 

Lys Asp Val Leu Thr Val Leu Glu He Met Asn Glu Met Ala His Arg 
1330 1335 1340 

Gly Tyr Arg Met Gin Pro He Ser Leu Glu Lys Ser Gin Ala Phe Glu 
1345 1350 1355 1360 

Phe He He Glu Gly Asp Thr Leu He Pro Pro Phe He Ser Val Pro 
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1365 1370 1375 

Gly Leu Gly Glu Asn Val Ala Lys Arg lie Val Glu Ala Arg Asp Asp 
1380 1385 1390 

Gly Pro Phe Leu Ser Lys Glu Asp Leu Aen Lys Lys Ala Gly Leu Tyr 
1395 1400 1405 

Gin Lys lie lie Glu Tyr Leu Asp Glu Leu Gly Ser Leu Pro Asn Lau 
1410 1415 1420 

Pro Asp Lys Ala Gin Leu Ser lie Phe Asp Met 
1425 1430 1435 



<210> 9 
<211> 1134 
<212> DNA 

<213> Staphylococcus aureus 
<400> 9 

atgatggaat tcactattaa aagagattat 

gctatttcac caagaacaac attacctata 

catgaagtta tattaactgg ttcagactct 

actgtagatg gcgaagatat tgtcaatatt 

cgattctttg ttgatattat aaaaaaatta 

gaacaattcc agacattaat tacatcaggt 

ccagatcaat atcctttatt acctcaagtt 

aaagtactta aaaacgtgat tgcacaaacg 

ccagtactaa ctggtgtgaa ctggcttata 

gattcacacc gcttggctgt aagaaagttg 

gtcatcattc caggtaaggc tttagctgaa 

gacattgata tcttctttgc ttcaaaccaa 

atttctcgat tattagaagg acattatcct 

gaaattaaat taagtataga caatggggag 

ttagcacgtg aaggtggtaa taacgttatt 

ttatcttcta catcaccaga aattggtact 

gaaggtggta gcctgaaaat ttcattcaac 

atcgataatg atgaggttga agttgaattc 

ccaaaaggtg acgactcggt aacgcaatta 



tttattacac 


aattaaatga 


cacattaaaa 


60 


ttaactggta 


tcaaaatcga 


tgcgaaagaa 


120 


gaaatttcaa 


tagaaatcac 


tattcctaaa 


180 


tcagaaacag 


gctcagtagt 


acttcctgga 


240 


cctggtaaag 


atgttaaatt 


atctacaaat 


300 




ttaatttgag 


tggcttagat 


360 


tctagagatg 


acgcaattca 


attgtcggta 


420 


aattttgcag 


tgtccacctc 


agaaacacgc 


480 


caagaaaatg 


aattaatatg 


cacagcgact 


540 


cagttagaag 


atgtttctga 




600 


ttaaataaaa 


ttatgtctga 


caatgaagaa 


660 


gttttattta 


aagttggaaa 


tgtgaacttt 


720 


gatacaacac 


gtttattccc 


tgaaaactat 


780 


ttttatcatg 


cgattgatcg 


tgcctcttta 


840 


aaattaagta 


caggtgatga 


cgttgttgaa 


900 


gtaaaagaag 


aagttgatgc 


aaacgatgtt 


960 


tctaaatata 


tgatggatgc 


tttaaaagca 


1020 


ttcggtacaa 


tgaaaccatt 


tattctaaaa 


1080 


attttaccaa 


tcagaactta 


ctaa 


1134 



<210> 10 
<211> 377 
<212> PRT 

<213> Staphylococcus aureus 
<400> 10 



20 
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Met Met Glu Phe Thr 
1 5 

Asp Thr Leu Lys Ala 

20 

Gly lie Lys lie Asp 
35 

Asp Ser Glu lie Ser 
50 

Glu Asp lie Val Asn 
65 

Arg Phe Phe Val Asp 
85 

Leu Ser Thr Asn Glu 
100 

Glu Phe Asn Leu Ser 
115 

Gin Val Ser Arg Asp 
130 



lie Lys Arg Asp Tyr Phe 
10 

lie Ser Pro Arg Thr Thr 
25 

Ala Lys Glu His Glu Val 
40 

lie Glu lie Thr lie Pro 
55 

lie Ser Glu Thr Gly Ser 
70 75 

lie lie Ly6 Lys Leu Pro 
90 

Gin Phe Gin Thr Leu lie 
105 

Gly Leu Asp Pro Asp Gin 
120 

Asp Ala lie Gin Leu Ser 
135 



He Thr Gin Leu Asn 
15 

Leu Pro He Leu Thr 
30 

lie Leu Thr Gly Ser 
45 

Lys Thr Val Asp Gly 
GO 

Val Val Leu Pro Gly 
80 

Gly Lys Asp Val Lys 
95 

Thr Ser Gly His Ser 
110 

Tyr Pro Leu Leu Pro 
125 

Val Lys Val Leu Lys 
140 



Asn Val lie Ala Gin Thr Asn Phe 
145 150 

Pro Val Leu Thr Gly Val Asn Trp 
165 

Cys Thr Ala Thr Asp Ser His Arg 
180 

Glu Asp Val Ser Glu Asn Lys Asn 
195 200 

Ala Glu Leu Asn Lys He Met Ser 
210 215 

Phe Phe Ala Ser Asn Gin Val Leu 
225 230 

He Ser Arg Leu Leu Glu Gly His 
245 



Ala Val Ser Thr Ser Glu Thr Arg 
155 160 

Leu He Gin Glu Asn Glu Leu He 
170 175 

Leu Ala Val Arg Lys Leu Gin Leu 
185 190 

Val He He Pro Gly Lys Ala Leu 
205 

Asp Asn Glu Glu Asp He Asp He 
220 

Phe Lys Val Gly Asn Val Asn Phe 
235 240 

Tyr Pro Asp Thr Thr Arg Leu Phe 
250 255 
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Pro Glu Asn Tyr Glu lie Lys Leu Ser lie Asp Asn Gly Glu Phe Tyr 
260 265 270 

Hie Ala lie Asp Arg Ala Ser Leu Leu Ala Arg Glu Gly Gly Asn Asn 
275 280 285 

Val lie Lys Leu Ser Thr Gly Asp Asp Val Val Glu Leu Ser Ser Thr 
290 295 300 

Ser Pro Glu He Gly Thr Val Lys Glu Glu Val Asp Ala Asn Asp Val 
305 310 315 320 

Glu Gly Gly Ser Leu Lys He Ser Phe Asn Ser Lys Tyr Met Met Asp 
325 330 335 

Ala Leu Lys Ala He Asp Asn Asp Glu Val Glu Val Glu Phe Phe Gly 
340 345 350 

Thr Met Lys Pro Phe lie Leu Lys Pro Lys Gly Asp Asp Ser Val Thr 
355 360 365 

Gin Leu He Leu Pro He Arg Thr Tyr 
370 375 



<210> 11 
<211> 930 
<212> DNA 

<213> Staphylococcus aureus 
<400> 11 

atggatgaac agcaacaatt gacgaatgca 
ttatttgaag gtgatgatgc acaaacgatg 
attttatgtc aaacagatag tcaatgtgaa 
tttatgtata tatcaacaac tgagaatgca 
cgtcatatga atcaacttcc tatagaaagc 
gaagactttg aaaagttaac tgttcaaggg 
ccaccggaca atacgattgc tattttattg 
atccattcaa ggtgtcagca tgtatatttc 
agattagttg aacaaaacat gtctaagcca 
caaatagata atgcaatggc tttaaatgaa 
gttatacgtt gggaattgtt gcttactaat 
ttattgaaac aggctaaaaa taaaaaactg 
ttcttcgaag atatcataca tacaaaggta 
ttaaaaaatg atattgatca atatgcgcaa 
tttgatcaac tgacggaagc acataagaaa 
tttgaacaaa tcgtaattaa gggtgtgagt 



tatcattcaa 


ataaattatc 


gcatgcctat 


60 


aaacaagttg 


cgattaattt 


tgcaaagctt 


120 


acaaaggtta 


gtacatataa 


tcatccagac 


180 


attaagaaag 


aacaagttga 


acaacttgtg 


240 


acaaataaag 


tgtacatcat 


cgaagacttt 


300 


gaaaacagta 


tcttgaaatt 


tcttgaagaa 


360 


tctacaaaac 


ctgagcaaat 


tttagacaca 


420 


aagcctattg 


ataaagaaaa 


gtttataaat 


480 


gtagctgaaa 


tgattagtac 


ttatactacg 


540 


gaatttgatt 


tattagcatt 


aaggaaatca 


600 


aagccaatgg 


cacttatagg 


tattattgat 


660 


caatctttaa 


ctattgcagc 


tgtgaatggt 


720 


aatgtagagg 


ataaacaaat 


atatagtgat 


780 


aagttgtcgt 


ttaatcaatt 


aattttgatg 


840 


ttgaatcaaa 


atgtaaatcc 


aacgcttgta 


900 








930 
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<210> 12 
<211> 310 
<212> PRT 

<213> Staphylococcus aureus 
<400> 12 

Met Asp Glu Gin Gin Gin Leu Thr Asn Ala Tyr His Ser Asn Lys Leu 
15 10 15 

Ser His Ala Tyr Leu Phe Glu Gly Asp Asp Ala Gin Thr Met Lys Gin 
20 25 30 

Val Ala He Asn Phe Ala Lys Leu He Leu Cys Gin Thr Asp Ser Gin 
35 40 45 

Cys Glu Thr Lys Val Ser Thr Tyr Asn His Pro A6p Phe Met Tyr He 
50 55 60 

Ser Thr Thr Glu Asn Ala He Lys Lys Glu Gin Val Glu Gin Leu Val 



Arg His Met Asn Gin Leu Pro He Glu Ser Thr Asn Lys Val Tyr He 
85 90 95 

He Glu Asp Phe Glu Asp Phe Glu Lys Leu Thr Val Gin Gly Glu Asn 
100 105 110 

Ser He Leu Lys Phe Leu Glu Glu Pro Pro Asp Asn Thr He Ala He 
115 120 125 

Leu Leu Ser Thr Lys Pro Glu Gin He Leu Asp Thr He His Ser Arg 
130 135 140 

Cys Gin His Val Tyr Phe Lys Pro He Asp Lys Glu Lys Phe He Asn 
145 150 155 160 

Arg Leu Val Glu Gin Asn Met Ser Lys Pro Val Ala Glu Met He Ser 
165 170 175 

Thr Tyr Thr Thr Gin He Aep Asn Ala Met Ala Leu Asn Glu Glu Phe 
180 185 190 

Asp Leu Leu Ala Leu Arg Lys Ser Val He Arg Trp Glu Leu Leu Leu 
195 200 205 

Thr Asn Lys Pro Met Ala Leu He Gly He He Asp Leu Leu Lys Gin 
210 215 220 



23 



WO 01/09164 



PCT/L'SOO/20666 



Ala Lye Asn Lys Lye Leu Gin Ser Leu Thr lie Ala Ala Val Asn Gly 
225 230 235 240 

Phe Phe Glu Asp lie He His Thr Lys Val Asn Val Glu Asp Lys Gin 
245 250 255 

He Tyr Ser Asp Leu Lys Asn Asp He Asp Gin Tyr Ala Gin Lys Leu 
260 265 270 

Ser Phe Asn Gin Leu He Leu Met Phe Asp Gin Leu Thr Glu Ala His 
275 280 285 

Lys Lys Leu Asn Gin Asn Val Asn Pro Thr Leu Val Phe Glu Gin He 
290 295 300 

Val He Lys Gly Val Ser 
305 310 



<210> 13 
<211> 744 
<212> DNA 

<213> Staphylococcus aureus 



<400> 13 














atgagcgaca 


atattgtagc 


tatttatgga 


gatgtgcctg 


aattggttga 


aaaacaaagt 


60 


gcagaaatca 


tatcacaatt 


tttgaaaagt 


gatagagatg 


actttaactt 


tgtgaaatat 


12 0 


aatttatacg 


aaacagagat 


tgcaccaatt 


gttgaagaaa 


cattaacatt 


gcctttcttt 


180 


tcagataaaa 


aagcaatttt 


ggttaaaaat 




ttacaggtga 


aaaagcgcca 


240 


aaagatatgg 


ctcataatgt 


agaccaatta 


atagaattta 


ttgaaaaata 


tgatggcgaa 


300 


aatttgattg 


tctttgagat 


atatcaaaat 


aaacttgatg 


aaagaaaaaa 


gttaactaaa 


360 


actctaaaaa 


agcatgcaag 


gcttaaaaaa 


atagagcaga 


tgtcggagga 


gatcaagtgg 


420 


attcaaaaaa 


aagaacaagc 


gattgatttt 


gtaaaagatc 


ttataacaat 


gaaagaagaa 


480 


ccaattaaac 


ttcttgcact 


tacatcaaat 


tatagacttt 


tttatcaatg 


taaaattctt 


540 




gttatagtgg 


tcaacaaatt 


gcaaaaacaa 


taggtgttca 


tccatataga 


600 


gtgaaacttg 


cacttggtca 


agtgagacat 


tatcaacttg 


atgaacttct 


taatattatt 


660 


gatgcatgtg 


cagaaacaga 


ttataaactt 


aaatcatcat 


atatggataa 


acaacttatt 


720 


cttgaacttt 


ttattctttc 


actt 








744 



<210> 14 
<211> 248 
<212> PRT 

<213> Staphylococcus aureus 
<400> 14 

Met Ser Asp Asn He Val Ala He Tyr Gly Asp Val Pro Glu Leu Val 
24 
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15 10 15 

Glu Lys Gin Ser Ala Glu lie He Ser Gin Phe Leu Lys Ser Asp Arg 
20 25 30 

Asp Aap Phe Asn Phe Val Lys Tyr Asn Leu Tyr Glu Thr Glu He Ala 
35 40 45 

Pro He Val Glu Glu Thr Leu Thr Leu Pro Phe Phe Ser Asp Lys Lys 
50 55 60 

Ala He Leu Val Lys Asn Ala Tyr He Phe Thr Gly Glu Lys Ala Pro 



Lys Asp Met Ala His Asn Val Asp Gin Leu He Glu Phe He Glu Lys 
85 90 95 

Tyr Asp Gly Glu Asn Leu He Val Phe Glu He Tyr Gin Asn Lys Leu 
100 105 no 

Asp Glu Arg Lys Lys Leu Thr Lys Thr Leu Lys Lys Hie Ala Arg Leu 
115 120 125 

Lys Lys He Glu Gin Met Ser Glu Glu He Lys Trp He Gin Lys Lys 
130 135 140 

Glu Gin Ala He Asp Phe Val Lys Asp Leu He Thr Met Lye Glu Glu 
145 150 155 160 

Pro He Lys Leu Leu Ala Leu Thr Ser Asn Tyr Arg Leu Phe Tyr Gin 
165 170 175 

Cys Lys He Leu Ser Gin Lys Gly Tyr Ser Gly Gin Gin He Ala Lys 
180 185 190 

Thr He Gly Val His Pro Tyr Arg Val Lys Leu Ala Leu Gly Gin Val 
195 200 205 

Arg His Tyr Gin Leu Asp Glu Leu Leu Asn He He Asp Ala Cys Ala 
210 215 220 

Glu Thr Asp Tyr Lys Leu Lys Ser Ser Tyr Met Asp Lys Gin Leu He 
225 230 235 240 

Leu Glu Leu Phe He Leu Ser Leu 
245 
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<210> 15 
<211> 1719 
<212> DNA 

<213> Staphylococcus aureus 
<400> 15 

atgataggtt tgtgtccttt tcatgatgaa aagacacctt catttacagt ttctgaagat 60 
aaacaaatct gtcattgttt tggttgtaaa aaaggtggca atgtttttca atttactcaa 120 
gaaattaaag acatatcatt tgttgaagcg gttaaagaat taggtgatag agttaatgtt 180 
gctgtagata ttgaggcaac acaatctaac tcaaatgttc aaattgcttc tgatgattta 240 
caaatgattg aaatgcatga gttaatacaa gaattttatt attacgcttt aacaaagaca 300 
gtcgaaggcg aacaagcatt aacatactta caagaacgtg gttttacaga tgcgcttatt 360 
aaagagcgag gcattggctt tgcacccgat agctcacatt tttgtcatga ttttcttcaa 420 
aaaaagggtt acgatattga attagcatat gaagccggat tattatcacg taacgaagaa 480 
aatttcagtt attacgatag atttcgaaat cgtattatgt ttcctttgaa aaatgcgcaa 540 
ggaagaattg ttggatattc aggtcgaaca tataccggtc aagaaccaaa atacctaaat 600 
agtcctgaaa cgcctatctt tcaaaaaaga aagttgttat ataacttaga taaagcacgt 660 
aaatcaatta gaaaattaga tgaaattgta ttactagaag gttttatgga tgttataaaa 720 
tctgatactg ctggcttgaa aaacgttgtt gcaacaatgg gtacacagtt gtcagatgaa 780 
catattacct ttatacgaaa gttaacatca aatataacat taatgtttga tggggatttt 840 
gcgggtagtg aagcaacact taaaacaggt caacatttgt tacagcaagg gctaaatgta 900 
tttgttatac aattgccatc tggcatggat ccggatgaat acattggtaa gtatggcaac 960 
gacgcattta ctacttttgt aaaaaatgac aaaaagtcat ttgcacatta taaagtaagt 1020 
atattaaaag atgaaattgc acataatgac ctttcatatg aacgttattt gaaagaactg 1080 
agtcatgaca tttcacttat gaagtcatca attctgcaac aaaaggctat aaatgatgtt 1140 
gcgccatttt tcaatgttag tcctgagcag ttagctaacg aaatacaatt caatcaagca 1200 
ccagccaatt attatccaga agatgagtat ggcggttatg atgagtatgg cggttatatt 1260 
gaacctgagc caattggtat ggcacaattt gacaatttga gccgtcgaga aaaagcggag 1320 
cgagcatttt taaaacattt aatgagagat aaagatacat ttttaaatta ttatgaaagt 1380 
gttgataagg ataacttcac aaatcagcat tttaaatatg tattcgaagt cttacatgat 1440 
ttttatgcgg aaaatgatca atataatatc agtgatgctg tgcagtatgt taattcaaat 1500 
gagttgagag aaacactaat tagcttagaa caatataatt tgaatggcga accatatgaa 1560 
aatgaaattg atgattatgt caatgttatt aatgaaaaag gacaagaaac aattgagtca 1620 
ttgaatcata aattaaggga agctacaagg attggcgatg tagaattaca aaaatactat 1680 
ttacagcaaa ttgttgctaa gaataaagaa cgcatgtag 1719 



<210> 16 
<211> 572 
<212> PRT 

<213> Staphylococcus aureus 
<400> 16 

Met He Gly Leu Cys Pro Phe His Asp Glu Lys Thr Pro Ser Phe Thr 
15 10 is 



Val Ser Glu Asp Lys Gin He Cys His Cys Phe Gly Cys Lys Lys Gly 
20 25 30 
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Gly Aen Val Phe Gin Phe Thr Gin Glu He Lys Asp lie Ser Phe Val 
35 40 45 

Glu Ala Val Lye Glu Leu Gly Asp Arg Val Asn Val Ala Val Asp lie 
50 55 60 

Glu Ala Thr Gin Ser Asn Ser Asn Val Gin He Ala Ser Asp Asp Leu 
65 70 75 80 

Gin Met He Glu Met His Glu Leu He Gin Glu Phe Tyr Tyr Tyr Ala 
85 90 95 

Leu Thr Lys Thr Val Glu Gly Glu Gin Ala Leu Thr Tyr Leu Gin Glu 
100 105 no 

Arg Gly Phe Thr Asp Ala Leu He Lys Glu Arg Gly He Gly Phe Ala 
115 120 125 

Pro Asp Ser Ser His Phe Cys His Asp Phe Leu Gin Lys Lys Gly Tyr 
130 135 140 

Asp He Glu Leu Ala Tyr Glu Ala Gly Leu Leu Ser Arg Asn Glu Glu 
145 150 155 160 

Asn Phe Ser Tyr Tyr Asp Arg Phe Arg Asn Arg He Met Phe Pro Leu 
165 170 175 

Lys Asn Ala Gin Gly Arg He Val Gly Tyr Ser Gly Arg Thr Tyr Thr 
180 185 190 

Gly Gin Glu Pro Lys Tyr Leu Asn Ser Pro Glu Thr Pro He Phe Gin 
195 200 205 

Lye Arg Lys Leu Leu Tyr Asn Leu Asp Lys Ala Arg Lys Ser He Arg 
210 215 220 

Lys Leu Asp Glu He Val Leu Leu Glu Gly Phe Met Asp Val He Lys 
225 230 235 240 

Ser Asp Thr Ala Gly Leu Lye Asn Val Val Ala Thr Met Gly Thr Gin 
245 250 255 

Leu Ser Asp Glu His He Thr Phe lie Arg Lys Leu Thr Ser Asn He 
260 265 270 



Thr Leu Met Phe Asp Gly Asp Phe Ala Gly Ser Glu Ala Thr Leu Lys 
275 280 285 
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Thr Gly Gin His Leu Leu Gin Gin Gly Leu Asn Val Phe Val lie Gin 
290 295 300 

Leu Pro Ser Gly Met Asp Pro Asp Glu Tyr lie Gly Lys Tyr Gly Asn 
305 310 315 320 

Asp Ala Phe Thr Thr Phe Val Lys Asn Asp Lys Lys Ser Phe Ala His 
325 330 335 

Tyr Lys Val Ser He Leu Lys Asp Glu He Ala His Asn Asp Leu Ser 
340 345 350 

Tyr Glu Arg Tyr Leu Lys Glu Leu Ser His Asp He Ser Leu Met Lys 
355 360 365 

Ser Ser He Leu Gin Gin Lys Ala He Asn Asp Val Ala Pro Phe Phe 
370 375 380 

Aen Val Ser Pro Glu Gin Leu Ala Asn Glu He Gin Phe Asn Gin Ala 
385 390 395 400 

Pro Ala Asn Tyr Tyr Pro Glu Asp Glu Tyr Gly Gly Tyr Asp Glu Tyr 
405 410 415 

Gly Gly Tyr He Glu Pro Glu Pro He Gly Met Ala Gin Phe Asp Aen 
420 425 430 

Leu Ser Arg Arg Glu LyB Ala Glu Arg Ala Phe Leu LyB His Leu Met 
435 440 445 

Arg Asp Lys Asp Thr Phe Leu Asn Tyr Tyr Glu Ser Val Asp Lys Asp 
450 455 460 

Asn Phe Thr Asn Gin His Phe Lys Tyr Val Phe Glu Val Leu Hie Asp 
465 470 475 480 

Phe Tyr Ala Glu Asn Asp Gin Tyr Asn He Ser Asp Ala Val Gin Tyr 
485 490 495 

Val Asn Ser Asn Glu Leu Arg Glu Thr Leu He Ser Leu Glu Gin Tyr 
500 505 510 

Asn Leu Asn Gly Glu Pro Tyr Glu Aen Glu He Asp Asp Tyr Val Asn 
S15 520 525 



Val He Asn Glu Lye Gly Gin Glu Thr He Glu Ser Leu Asn His Lys 
530 535 540 
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Leu Arg Glu Ala Thr Arg lie Gly Asp Val Glu Leu Gin Lys Tyr Tyr 
545 550 555 560 

Leu Gin Gin lie Val Ala Lye Asn Lys Glu Arg Met 
565 570 



<210> 17 
<211> 4395 
<212> DNA 

<213> Streptococcus pyogenes 
<400> 17 

atgtcagatt tattcgctaa attgatggac 
tcaagtgcct tttcatctgc tgatattatc 
tgggaatttc attttgcctt tgcagcggtt 
gatcgtttga taagaacttt tgaggcggct 
gctcaggtgg attattcaga tgatctgctt 
gcaccgtgta atagtgctag ttttaaatct 
gatgacaaac tcattattgc agcgccaggt 
catctgccta atctggtcaa gcaattagaa 
atggtgtcag atcaggaaat gactgagcat 
gctcttgtga aaaaggctgt gcaggataat 
atgccaccag ttgaggaagc cacacctgct 
aagcgtcagg cagggtttga aaaagcaacc 
gaaaaccgga ttgtctttga gggtatggtt 
ggtcgccata tcatcaactt taaaatgaca 
tgggctaaag acgatgagga gctccgtaaa 
cgggtacaag ggaatattga gaccaatcct 
caggtcaaag aaattgtccg tcatgagcgc 
gtcgaacttc atgcccacac caatatgtct 
ttgattgata cggcagccaa gtggggacac 
gtgcaaagtt ttcctcatgg ctaccatagg 
ggcctagaag ccaatattgt tgaggacaag 
gatttgcacg aagccaccta tgtggtcttt 
aataatgacc tgattcagat tgcggcttcc 
tttgatgaat tcattgatcc tgggcatcct 
attaccgata agcatttgca gggcgccaag 
gacttttgca aagatagtat cttggttgcc 
aacgccaatt atgaacgcca cgacttgccc 
gaatttgcta gaaacttgta tcctgagtac 
cgtttccaag tgagtctaga ccaccatcat 
cgtcttttgt ttatttttct aaaagatgcc 
caactcaata cagatttggt ggctgaggat 
actatctatg tgcaaaatca ggttggtctt 
aatatcaaat attttgaagg ggtgccgcgt 
gagggtttgt tacttggtac agcttgttct 



cagatagaaa tgccacttga catgagacgt 60 
gaggtaaagg tacattcggt gtcacgcttg 120 
ttaccgattg caacttatcg tgaattgcat 180 
gacattaagg taacctttga catccaagct 240 
caagcttatt accaagaagc ttttgagcat 300 
tctttctcaa agctcaaagt gacttatgag 360 
tttgtgaata acgatcattt tagaaacaat 420 
gcctttggct ttggcatctt gaccatagat 480 
ttgaccaaga attttgtttc cagtcgtcag 540 
ttggaagccc aaaaatctct tgaagccatg 600 
cctaagtttg actacaagga acgagcagct 660 
atcacaccaa tgattgagat tgagaccgaa 72 0 
tttgacgtgg agcgtaaaac gactaggaca 780 
gactatacct cctcgtttgc tctccaaaaa 840 
tttgatatga ttgctaaggg agcttggtta 900 
tttacgaaga gtctcaccat gaatgtccag 960 
aaagacctga tgccagaagg gcaaaagcgg 102 0 
accatggatg ccttaccgac agtagaaagc 1080 
aaggcgattg ctatcaccga ccatgctaat 114 0 
gctcgcaaag ctgggattaa ggctattttt 1200 
gtgcctattt cttatgaacc tgttgatatg 1260 
gacgtggaaa ccacaggtct atctgctatg 1320 
aaaatgttta aaggaaatat tgtagagcag 13 80 
ctttcagcct ttaccaccga attgacagga 144 0 
ccattggtta ctgtcctaaa agcttttcag 1500 
cacaacgcca gttttgacgt gggctttatg 1560 
aaaatcacac agcctgtgat tgatacctta 1620 
aagcgtcacg gtttgggacc gctcaccaag 1680 
atggccaatt acgacgcgga agccacagga 1740 
agagaaaagc atggcatcaa aaatcttttg 1800 
tcttacaaaa aagcgcggat taagcatgcg 1860 
aaaaatatgt ttaagttggt cagcctttcc 1920 
attccaagaa ccgtcttaga tgctcacaga 1980 
gacggcgagg tttttgatgc cgttctgact 204 0 
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aaaggaattg atgcagcggt tgatttggct aggtattatg attttatcga aatcatgcca 2100 
ccagccattt accagccatt ggttgtccgt gaattaatca aagatcaagc aggtattgag 2160 
caggtgattc gtgacctcat tgaagtaggg aaacgagcta agaaacctgt gcttgccact 2220 
gggaatgtgc attatctaga gcctgaagaa gagatttacc gtgaaattat tgtgcgtagt 2280 
cttggtcagg gtgccatgat taatagaaca atcggccgtg gggaaggggc acagcctgct 2340 
cctctaccta aagcgcactt tagaacaacc aatgaaatgc tggatgagtt tgcctttctt 2400 
ggaaaagacc tcgcttatca agtagttgtg caaaatactc aggattttgc ggaccgtatt 2460 
gaggaagtgg aagtggttaa gggcgatctt tacaccccgt atattgataa ggccgaagag 2520 
acggttgccg aattaaccta tcaaaaagcc tttgaaattt atggtaatcc tctcccagat 2580 
attattgatt tacgcattga aaaagagtta acctctatct tggggaacgg ttttgctgtg 2640 
atttatctcg cttcccaaat gcttgttaac cggtcaaatg agcgaggcta cctagttggt 2700 
tctaggggat ctgtagggtc tagctttgtc gccaccatga ttgggattac tgaggttaat 27 60 
cctatgccgc ctcactacgt ttgcccgtcc tgccaacatt ctgaatttat cacagatggg 2820 
tcagttggat ctggctatga tttgcctaat aaaccctgtc cgaaatgtgg caccccttat 2880 
caaaaagatg ggcaagacat tccctttgag acctttcttg ggtttgatgg ggataaggtg 294 0 
cccgatattg atttgaactt ctctggtgat gaccagccca gtgcccattt ggatgtccga 3000 
gatatttttg gtgacgaata cgcctttcgt gctggaacag ttggtaccgt agcagaaaaa 3060 
acagcttatg gatttgtcaa aggctatgaa cgcgactatg gcaagttcta tcgtgatgct 3120 
gaggtggatc gtctagcagc aggtgctgct ggtgtgaaac gaacgactgg gcagcaccct 3180 
ggggggattg ttgttattcc taattacatg gatgtttatg attttacccc cgtgcaatat 3240 
ccagccgatg atgtaacggc ttcttggcag acaactcact ttaacttcca tgatattgat 3300 
gaaaacgtct tgaaacttga tatcctaggg catgatgatc cgaccatgat tcgtaaactt 3360 
caggatttat cgggcattga tcctattact attcctgctg atgatccggg agttatggct 3420 
ctcttttctg ggacagaggt tttgggcgtt accccggaac aaattgggac accgactggt 3480 
atgctaggca ttccagaatt tggaaccaac tttgttcgcg gcatggttaa tgagacgcat 3 54 0 
ccgaccactt ttgcggagct tttgcagttg tctggactat ctcatggaac cgatgtttgg 3600 
cttggtaatg cacaagattt gattaaagaa ggcattgcaa ccctaaaaac cgttatcggt 3660 
tgtcgtgacg acatcatggt ttacctcatg cacgcaggct tagaaccaaa aatggccttt 3720 
accattatgg agcgtgtgcg taagggatta tggctaaaaa tttctgagga agaacgtaat 3780 
ggctatattg atgccatgcg agaaaacaat gtgcccgact ggtacattga atcgtgtgga 3840 
aaaatcaagt acatgttccc taaagcccat gcggcagctt atgttttgat ggcccttcgg 3900 
gtggcttatt tcaaggtgca ccaccccatt atgtattatt gtgcttattt ctctattcgt 3960 
gcgaaggctt ttgaattaaa aaccatgagt ggtggtttag atgctgttaa agcaagaatg 4020 
gaagatatta ctataaaacg taaaaataat gaagccacca atgtggaaaa tgacctcttt 4080 
acaaccttgg agattgtcaa cgaaatgtta gaacgcggct ttaagtttgg caaattagac 4140 
ctttacaaaa gtgatgctat agaattccaa atcaaaggag atacccttat ccctccattt 4200 
atagcgctag aaggtctggg tgaaaacgtg gccaagcaaa tcgttaaagc tcgtcaagaa 4260 
ggcgaattcc tctctaaaat ggaattgcgt aaacgaggcg gggcatcgtc aacgctcgtt 4320 
gagaaaatgg atgagatggg tattttagga aatatgccag aagataatca attaagtctt 4380 
tttgatgact ttttc 4395 



<210> 18 
<211> 1465 
<212> PRT 

<213> Streptococcus pyogenes 
<400> 18 
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Met Ser Asp Leu Phe Ala Lys Leu Met Asp Gin He Glu Met Pro Leu 
1 5 io 15 

Asp Met Arg Arg Ser Ser Ala Phe Ser Ser Ala Asp He He Glu Val 
20 25 30 

Lys Val Hie Ser Val Ser Arg Leu Trp Glu Phe His Phe Ala Phe Ala 
35 40 45 

Ala Val Leu Pro He Ala Thr Tyr Arg Glu Leu His Asp Arg Leu He 
50 55 60 

Arg Thr Phe Glu Ala Ala Asp He Lys Val Thr Phe Asp He Gin Ala 



Ala Gin Val Asp Tyr Ser Asp Asp Leu Leu Gin Ala Tyr Tyr Gin Glu 
85 90 95 

Ala Phe Glu His Ala Pro Cys Asn Ser Ala Ser Phe Lys Ser Ser Phe 
100 105 no 

Ser Lys Leu Lys Val Thr Tyr Glu Asp Asp Lys Leu He He Ala Ala 
115 120 125 

Pro Gly Phe Val Asn Asn Asp His Phe Arg Asn Aen His Leu Pro Asn 
130 135 140 

Leu Val Lys Gin Leu Glu Ala Phe Gly Phe Gly He Leu Thr He Asp 
"5 150 155 160 

Met Val Ser Asp Gin Glu Met Thr Glu His Leu Thr Lys Asn Phe Val 
165 170 175 

Ser Ser Arg Gin Ala Leu Val Lys Lys Ala Val Gin Asp Asn Leu Glu 
180 185 190 

Ala Gin Lys Ser Leu Glu Ala Met Met Pro Pro Val Glu Glu Ala Thr 
195 200 205 

Pro Ala Pro Lys Phe Asp Tyr Lys Glu Arg Ala Ala Lys Arg Gin Ala 
210 215 220 

Gly Phe Glu Lys Ala Thr He Thr Pro Met He Glu He Glu Thr Glu 
225 230 235 240 

Glu Asn Arg He Val Phe Glu Gly Met Val Phe Asp Val Glu Arg Lys 
245 250 255 
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Thr Thr Arg Thr Gly Arg Hie lie lie Asn Phe Lys Met Thr Asp Tyr 
260 265 270 

Thr Ser Ser Phe Ala Leu Gin Lys Trp Ala Lys Asp Asp Glu Glu Leu 
275 280 285 

Arg Lys Phe Asp Met lie Ala Lys Gly Ala Trp Leu Arg Val Gin Gly 
290 295 300 

Asn lie Glu Thr Asn Pro Phe Thr Lys Ser Leu Thr Met Asn Val Gin 
305 310 315 320 

Gin Val Lys Glu lie Val Arg His Glu Arg Lys Asp Leu Met Pro Glu 
325 330 335 

Gly Gin Lys Arg Val Glu Leu His Ala His Thr Asn Met Ser Thr Met 
340 345 350 

Asp Ala Leu Pro Thr Val Glu Ser Leu lie Asp Thr Ala Ala Lys Trp 
355 360 365 

Gly His Lys Ala He Ala He Thr Asp His Ala Asn Val Gin Ser Phe 
370 375 380 

Pro His Gly Tyr His Arg Ala Arg Lys Ala Gly He Lys Ala He Phe 
385 390 395 400 

Gly Leu Glu Ala Asn He Val Glu Asp Lys Val Pro He Ser Tyr Glu 
405 410 415 

Pro Val Asp Met Asp Leu His Glu Ala Thr Tyr Val Val Phe Asp Val 
420 425 430 

Glu Thr Thr Gly Leu Ser Ala Met Asn Asn Asp Leu He Gin He Ala 
435 440 445 

Ala Ser Lys Met Phe Lys Gly Asn He Val Glu Gin Phe Asp Glu Phe 
450 455 460 

He Asp Pro Gly His Pro Leu Ser Ala Phe Thr Thr Glu Leu Thr Gly 
465 470 475 480 

He Thr Asp Lys His Leu Gin Gly Ala Lys Pro Leu Val Thr Val Leu 
485 490 495 

Lys Ala Phe Gin Asp Phe Cys Lys Asp Ser He Leu Val Ala Hie Asn 
500 505 510 
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Ala Ser Phe Asp Val Gly Phe Met Asn Ala Abii Tyr Glu Arg His Asp 
515 520 525 

Leu Pro Lys lie Thr Gin Pro Val lie Asp Thr Leu Glu Phe Ala Arg 
530 535 540 

Asn Leu Tyr Pro Glu Tyr Lys Arg His Gly Leu Gly Pro Leu Thr Lys 
545 550 555 560 

Arg Phe Gin Val Ser Leu Asp Hie HiB Hie Met Ala Asn Tyr Asp Ala 
565 570 575 

Glu Ala Thr Gly Arg Leu Leu Phe lie Phe Leu Lys Asp Ala Arg Glu 
580 585 590 

Lys His Gly He Lys Asn Leu Leu Gin Leu Asn Thr Asp Leu Val Ala 
595 600 605 

Glu Asp Ser Tyr Lys Lys Ala Arg He Lys His Ala Thr He Tyr Val 
610 615 620 

Gin Asn Gin Val Gly Leu Lys Asn Met Phe Lys Leu Val Ser Leu Ser 
625 630 635 640 

Aon He Lys Tyr Phe Glu Gly Val Pro Arg He Pro Arg Thr Val Leu 
645 650 655 

Asp Ala His Arg Glu Gly Leu Leu Leu Gly Thr Ala Cys Ser Asp Gly 
660 665 670 

Glu Val Phe Asp Ala Val Leu Thr Lys Gly He Asp Ala Ala Val Asp 
675 680 685 

Leu Ala Arg Tyr Tyr Asp Phe He Glu He Met Pro Pro Ala He Tyr 
690 695 700 

Gin Pro Leu Val Val Arg Glu Leu He Lys Asp Gin Ala Gly He Glu 
705 710 715 720 

Gin Val He Arg Asp Leu He Glu Val Gly Lys Arg Ala Lys Lys Pro 
725 730 735 

Val Leu Ala Thr Gly Asn Val His Tyr Leu Glu Pro Glu Glu Glu He 
740 745 750 

Tyr Arg Glu He He Val Arg Ser Leu Gly Gin Gly Ala Met He Asn 
755 760 765 
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Arg Thr He Gly Arg Gly Glu Gly Ala Gin Pro Ala Pro Leu Pro Lys 
770 775 780 

Ala His Phe Arg Thr Thr Asn Glu Met Leu Asp Glu Phe Ala Phe Leu 
785 790 795 800 

Gly Lys Asp Leu Ala Tyr Gin Val Val Val Gin Asn Thr Gin Asp Phe 
805 810 815 

Ala Asp Arg He Glu Glu Val Glu Val Val Lys Gly Asp Leu Tyr Thr 
820 825 830 

Pro Tyr He Asp Lys Ala Glu Glu Thr Val Ala Glu Leu Thr Tyr Gin 
835 840 845 

Lys Ala Phe Glu He Tyr Gly Asn Pro Leu Pro Asp He He Asp Leu 
850 855 860 

Arg He Glu Lys Glu Leu Thr Ser He Leu Gly Asn Gly Phe Ala Val 
865 870 875 880 

He Tyr Leu Ala Ser Gin Met Leu Val Asn Arg Ser Asn Glu Arg Gly 
885 890 895 

Tyr Leu Val Gly Ser Arg Gly Ser Val Gly Ser Ser Phe Val Ala Thr 
900 905 910 

Met He Gly He Thr Glu Val Asn Pro Met Pro Pro His Tyr Val Cys 
915 920 925 

Pro Ser Cys Gin His Ser Glu Phe He Thr Asp Gly Ser Val Gly Ser 
930 935 940 

Gly Tyr Asp Leu Pro Asn Lys Pro Cys Pro Lys Cys Gly Thr Pro Tyr 
945 950 955 960 

Gin Lys Asp Gly Gin Asp He Pro Phe Glu Thr Phe Leu Gly Phe Asp 
965 970 975 

Gly Asp Lys Val Pro Asp He Asp Leu Asn Phe Ser Gly Asp Asp Gin 
980 985 990 

Pro Ser Ala His Leu Asp Val Arg Asp He Phe Gly Asp Glu Tyr Ala 
995 1000 1005 

Phe Arg Ala Gly Thr Val Gly Thr Val Ala Glu Lys Thr Ala Tyr Gly 
1010 1015 1020 
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Phe Val Lys Gly Tyr Glu Arg Asp Tyr Gly LyB Phe Tyr Arg Asp Ala 
1025 1030 1035 1040 

Glu Val Asp Arg Leu Ala Ala Gly Ala Ala Gly Val Lys Arg Thr Thr 
1045 1050 1055 

Gly Gin His Pro Gly Gly lie Val Val lie Pro Asn Tyr Met Asp Val 
1060 1055 1070 

Tyr Asp Phe Thr Pro Val Gin Tyr Pro Ala Asp Asp Val Thr Ala Ser 
1075 1080 1085 

Trp Gin Thr Thr His Phe Asn Phe His Asp lie Asp Glu Asn Val Leu 
1090 1095 1100 

Lys Leu Asp lie Leu Gly His Asp Asp Pro Thr Met lie Arg Lys Leu 
1105 1110 1115 1120 

Gin Asp Leu Ser Gly lie Asp Pro lie Thr lie Pro Ala Asp Asp Pro 
1125 1130 1135 

Gly Val Met Ala Leu Phe Ser Gly Thr Glu Val Leu Gly Val Thr Pro 
1140 1145 1150 

Glu Gin lie Gly Thr Pro Thr Gly Met Leu Gly lie Pro Glu Phe Gly 
1155 1160 1165 

Thr Asn Phe Val Arg Gly Met Val Asn Glu Thr His Pro Thr Thr Phe 
1170 1175 1180 

Ala Glu Leu Leu Gin Leu Ser Gly Leu Ser His Gly Thr Asp Val Trp 
1185 1190 1195 1200 

Leu Gly Asn Ala Gin Asp Leu lie Lys Glu Gly lie Ala Thr Leu Lys 
1205 1210 1215 

Thr Val lie Gly Cys Arg Asp Asp lie Met Val Tyr Leu Met His Ala 
1220 1225 1230 

Gly Leu Glu Pro Lys Met Ala Phe Thr lie Met Glu Arg Val Arg Lys 
1235 1240 1245 

Gly Leu Trp Leu Lys lie Ser Glu Glu Glu Arg Asn Gly Tyr lie Asp 
1250 1255 1260 

Ala Met Arg Glu Asn Asn Val Pro Asp Trp Tyr lie Glu Ser Cys Gly 
1265 1270 1275 1280 
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Lys He Lys Tyr Met Phe Pro Lys Ala His Ala Ala Ala Tyr Val Leu 
1285 1290 1295 

Met Ala Leu Arg Val Ala Tyr Phe Lye Val His His Pro He Met Tyr 
1300 1305 1310 

Tyr Cys Ala Tyr Phe Ser lie Arg Ala Lys Ala Phe Glu Leu Lys Thr 
1315 1320 1325 

Met Ser Gly Gly Leu Asp Ala Val Lys Ala Arg Met Glu Asp He Thr 
1330 1335 1340 

He Lys Arg Lys Asn Asn Glu Ala Thr Asn Val Glu Asn Asp Leu Phe 
1345 1350 1355 1360 

Thr Thr Leu Glu He Val Asn Glu Met Leu Glu Arg Gly Phe Lys Phe 
1365 1370 1375 

Gly Lys Leu Asp Leu Tyr Lys Ser Asp Ala He Glu Phe Gin He Lys 
1380 1385 1390 



Gly Asp Thr Leu He Pro Pro Phe 
1395 1400 

Asn Val Ala Lys Gin He Val Lys 
1410 1415 

Ser Lys Met Glu Leu Arg Lys Arg 
1425 1430 

Glu Lys Met Asp Glu Met Gly He 
1445 



He Ala Leu Glu Gly Leu Gly Glu 
1405 

Ala Arg Gin Glu Gly Glu Phe Leu 
1420 

Gly Gly Ala Ser Ser Thr Leu Val 
1435 1440 

Leu Gly Asn Met Pro Glu Asp Asn 
1450 1455 



Gin Leu Ser Leu Phe Asp Asp Phe Phe 
1460 1465 



<210> 19 
<211> 3102 
<212> DNA 

<213> Streptococcus pyogenes 
<400> 19 

atgtttgctc aacttgatac taaaactgta 
aatcattatt ttgaacgagc aaagcaattt 
gataatcttt atggtgctta ccattttatt 
gttttaggtt tggaaataga gattctctat 
gcccagaata cacaaggcta tcatcagctt 



tactcattta tggatagttt aattgactta 60 
ggttaccaca ccataggaat catggataag 12 0 
aaaggttgtc aaaaaaatgg actgcagcca 180 
caagagcggc aggtgctcct taacttaatc 240 
ttaaaaattt ccacggcaaa aatgtctggc 300 
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aagcttcata tggattactt ctgccaacat 
aagggttgga gcgatacatt agtggtccct 
actgatttat ctcatatgga ttctaagagg 
tttgcgcaag atgatatgga aaccctgcac 
ctggcagaga cccctgtggt agaaagtgat 
gccttctatc aaacacactg ccctcaagct 
atctattatg atttcgatac aaatttaaaa 
aagcaagaat tgcaagactt gactgaggct 
ccttatcaat cgcgcttact acatgaattg 
tattttttga ttgtgtggga tttacttcgc 
atgggacgtg gctcggcggc aggtagtcta 
gatccagttc aacatgattt gctatttgag 
cctgatattg atatcgatct tccagatatt 
aatcgttatg gtagcgacca ttcggcgcaa 
attcgtgatg ttttcaaacg gttcggggtt 
aaaattggtt ttaaagatag cttggctact 
gttattaata gtagaactga atttcaaaag 
aatccaagac aaacgtccat tcacgcagct 
aatcatattc ctctaaaatc gggcgatgac 
gtcgaagcta atggcctgtt aaaaatggat 
caaaaaatgc aagagaaggt tgctaaagac 
gatttagaag acccgcaaac gttggcactt 
caatttgaac aaaatggtgc tattaatctt 
gaaattgttg ccactaccag tctaaataga 
attaaacgaa gagaaggaca agaaaaaatt 
ttagagccaa cttacggtat tatgctttat 
tatgctggtt ttacgttagg caaggccgac 
ctacaagaaa tgcaaaaaat ggaagaagac 
gctgaagaaa cagctagagg actttttaaa 
aaccgcagcc atgcctttgc ctattcagct 
cattacccgg ctgtttttta cgatatcatg 
gatgctctag aatcagattt tcaagtagcg 
gataaaattg aagctagcaa gatttacatg 
gattttgctt attggattat cgagcaaaga 
agaactccag aaaaatatca aaaaaaggtt 
tttgattgct ttgagcctaa ccgtaaaaaa 
tttgttaatg agcttggttc tcttttttca 
gattactcag taactgaaaa atattctttg 
aagcatcctt taattgatat tgctgagaaa 
ttagtcaaag aaagcgaagc agtcgtactg 
accaaaacaa gtgggcagca aatggctttt 
gatgtcacac tttttccaca agagtatgcc 
ttctattact taaaaggtag aataaaagaa 
caagtgcaaa tggctattag tcaaaaatat 
tcccaaattt ctgagatttt aggtgccttt 
caaaaaaata aggaaacaat tgcattaact 
aaggaaaaac ttcgtccttt tgttctgaaa 



ttggaaggga tagcggttat tattcctagt 3 60 
tttgactact atatgggtgt tgatcagtat 420 
cagcttatac ccctaaggac agttcgttat 480 
atgttgcatg ccattcgaga taacctcagt 540 
caagagttag cagattgtca acaactaacc 600 
ctacagaatt tagaagactt agtgtcagga 660 
ttgcctcatt ttaatagaga taagtctgcc 720 
ggtttgaagg aaaaaggatt gtggaaagag 780 
gtcattattt ctgacatggg ctttgatgat 840 
tttggacgca gtaaaggcta ttatatggga 900 
gtggcttatg ctctgaacat tacagggatt 960 
cgctttttaa acaaagaacg ttatagcatg 1020 
taccgttcag aatttctacg gtatgtccga 1080 
attgtgacct tttcaacctt tggccaggct 1140 
ccagaatacg aactgactaa tctcactaaa 1200 
gtctatgaaa agtcaatctc ttttaggcag 12 60 
gcttttgcca ttgccaagcg tatcgaagga 1320 
ggtattgtga tgagtgatga tgccttgacc 1380 
atgatgatca cccagtatga tgctcatgcg 1440 
tttttggggt taagaaattt gacctttgtt 1500 
tacgggtgtc agattgatat tacagccatt 1560 
tttgctaaag gggataccaa gggaattttc 1620 
ttaaaacgga ttaagccaca acgttttgaa 1680 
ccaggggcaa gtgactatac cactaatttc 1740 
gatttgattg atcctgtgat tgctcccatt 1800 
caagaacaag ttatgcagat tgcacaggtt 1860 
ttgttaaggc gtgccatgtc taaaaaaaat 1920 
tttattgctt ctgctaagca cctagggaga 1980 
cggatggaaa aatttgcagg ttatggtttt 2040 
ttagcttttc aattggctta tttcaaagcc 2100 
atgaattatt ctagcagtga ctatatcaca 2160 
caagttacca ttaatagtat tccttacact 2220 
gggctgaaaa atattaaggg gttgccaagg 2280 
ccatttaata gcgtagagga ttttctcact 2340 
ttccttgagc ctctgataaa aataggtctg 2400 
attctggaca atttggatgg tttactggta 2460 
gattcttcct ttagttgggt agatacgaaa 2520 
gaacaggaga tcgttggagt tggcatgagc 25 80 
agtacccaaa cttttactcc tatttcacag 2640 
attcaaatag atagcattag gatcattaga 2700 
ttaagtgtga atgacactaa gaaaaagctc 2760 
atttataaag accaattaaa agaaggagaa 2 82 0 
agagaccatc gactgcagat ggtgtgtcag 2880 
tggttattag ttgaaaacca tcagtttgat 2940 
ccaggaacga ctccagttgt tattcactat 3000 
aagattcagg ttcatgtaac agagaattta 3060 
acggtttttc ga 3102 
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<210> 20 
<211> 1034 
<212> PRT 

<213> Streptococcus pyogenes 
<400> 20 

Met Phe Ala Gin Leu Asp Thr Lye Thr Val Tyr Ser Phe Met Asp Ser 



Leu He Asp Leu Asn His Tyr Phe Glu Arg Ala Lys Gin Phe Gly Tyr 
20 25 30 

His Thr He Gly He Met Asp Lys Asp Asn Leu Tyr Gly Ala Tyr His 
35 40 45 

Phe He Lys Gly Cys Gin Lys Asn Gly Leu Gin Pro Val Leu Gly Leu 
50 55 GO 

Glu He Glu He Leu Tyr Gin Glu Arg Gin Val Leu Leu Asn Leu He 

65 70 75 80 

Ala Gin Asn Thr Gin Gly Tyr His Gin Leu Leu Lys He Ser Thr Ala 



Lys Met Ser Gly Lys Leu His Met Asp Tyr Phe Cys Gin His Leu Glu 
100 105 110 

Gly He Ala Val He He Pro Ser Lys Gly Trp Ser Asp Thr Leu Val 
115 120 125 

Val Pro Phe Asp Tyr Tyr Met Gly Val Asp Gin Tyr Thr Asp Leu Ser 
130 135 140 

His Met Asp Ser Lys Arg Gin Leu He Pro Leu Arg Thr Val Arg Tyr 
145 150 155 160 

Phe Ala Gin Asp Asp Met Glu Thr Leu His Met Leu His Ala He Arg 
165 170 175 

Asp Asn Leu Ser Leu Ala Glu Thr Pro Val Val Glu Ser Asp Gin Glu 
180 185 190 

Leu Ala Asp Cys Gin Gin Leu Thr Ala Phe Tyr Gin Thr His Cys Pro 
195 200 205 

Gin Ala Leu Gin Asn Leu Glu Asp Leu Val Ser Gly He Tyr Tyr Asp 
210 215 220 
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Phe Asp Thr Asn Leu Lys Leu Pro His Phe Asn Arg Asp Lys Ser Ala 
225 230 235 240 

Lys Gin Glu Leu Gin Asp Leu Thr Glu Ala Gly Leu Lys Glu Lys Gly 
245 250 255 

Leu Trp Lys Glu Pro Tyr Gin Ser Arg Leu Leu His Glu Leu Val lie 
260 265 270 

lie Ser Asp Met Gly Phe Asp Asp Tyr Phe Leu lie Val Trp Asp Leu 
275 280 285 

Leu Arg Phe Gly Arg Ser Lys Gly Tyr Tyr Met Gly Met Gly Arg Gly 
290 295 300 

Ser Ala Ala Gly Ser Leu Val Ala Tyr Ala Leu Asn lie Thr Gly lie 
305 310 315 320 

Asp Pro Val Gin His Asp Leu Leu Phe Glu Arg Phe Leu Asn Lys Glu 
325 330 335 

Arg Tyr Ser Met Pro Asp He Asp He Asp Leu Pro Asp lie Tyr Arg 
340 345 350 

Ser Glu Phe Leu Arg Tyr Val Arg Asn Arg Tyr Gly Ser Asp His Ser 
355 360 365 

Ala Gin He Val Thr Phe Ser Thr Phe Gly Pro Lys Gin Ala He Arg 
370 375 380 

Asp Val Phe Lys Arg Phe Gly Val Pro Glu Tyr Glu Leu Thr Asn Leu 
385 390 395 400 

Thr Lys Lys lie Gly Phe Lys Asp Ser Leu Ala Thr Val Tyr Glu Lys 
405 410 415 

Ser He Ser Phe Arg Gin Val He Asn Ser Arg Thr Glu Phe Gin Lys 
420 425 430 

Ala Phe Ala He Ala Lys Arg He Glu Gly Asn Pro Arg Gin Thr Ser 
435 440 445 

He His Ala Ala Gly He Val Met Ser Asp Asp Ala Leu Thr Asn His 
450 455 460 

He Pro Leu Lys Ser Gly Asp Asp Met Met He Thr Gin Tyr Asp Ala 
465 470 475 480 
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Hie Ala Val Glu Ala Asn Gly Leu Leu Lys Met Asp Phe Leu Gly Leu 
485 490 495 

Arg Asn Leu Thr Phe Val Gin Lys Met Gin Glu Lys Val Ala Lys Asp 
500 505 510 

Tyr Gly Cys Gin lie Asp He Thr Ala He Asp Leu Glu Asp Pro Gin 
515 520 525 

Thr Leu Ala Leu Phe Ala Lys Gly Asp Thr Lys Gly He Phe Gin Phe 
530 535 540 

Glu Gin Asn Gly Ala He Asn Leu Leu Lys Arg He Lys Pro Gin Arg 
545 550 555 560 

Phe Glu Glu He Val Ala Thr Thr Ser Leu Asn Arg Pro Gly Ala Ser 
565 570 575 

Asp Tyr Thr Thr Asn Phe He Lys Arg Arg Glu Gly Gin Glu Lys He 
580 585 590 

Asp Leu He A6p Pro Val He Ala Pro He Leu Glu Pro Thr Tyr Gly 
595 600 605 

He Met Leu Tyr Gin Glu Gin Val Met Gin He Ala Gin Val Tyr Ala 
610 615 620 

Gly Phe Thr Leu Gly Lys Ala Asp Leu Leu Arg Arg Ala Met Ser Lys 
625 630 635 640 

Lys Asn Leu Gin Glu Met Gin Lys Met Glu Glu Asp Phe He Ala Ser 
645 650 655 

Ala Lys His Leu Gly Arg Ala Glu Glu Thr Ala Arg Gly Leu Phe Lys 
660 665 670 

Arg Met Glu Lys Phe Ala Gly Tyr Gly Phe Asn Arg Ser His Ala Phe 
675 680 685 

Ala Tyr Ser Ala Leu Ala Phe Gin Leu Ala Tyr Phe Lys Ala His Tyr 
690 695 700 

Pro Ala Val Phe Tyr Asp He Met Met Asn Tyr Ser Ser Ser Asp Tyr 
705 710 715 720 

He Thr Asp Ala Leu Glu Ser Asp Phe Gin Val Ala Gin Val Thr He 
725 730 735 
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Asn Ser He Pro Tyr Thr Asp Lys He Glu Ala Ser Lys He Tyr Met 
740 745 750 

Gly Leu Lye Asn He Lys Gly Leu Pro Arg Asp Phe Ala Tyr Trp He 
755 760 765 

He Glu Gin Arg Pro Phe Asn Ser Val Glu Asp Phe Leu Thr Arg Thr 
770 775 780 

Pro Glu Lys Tyr Gin Lys Lys Val Phe Leu Glu Pro Leu He Lys He 
785 790 795 800 

Gly Leu Phe Asp Cys Phe Glu Pro Asn Arg Lys Lys He Leu Asp Asn 
805 810 815 

Leu Asp Gly Leu Leu Val Phe Val Asn Glu Leu Gly Ser Leu Phe Ser 
820 825 830 

Asp Ser Ser Phe Ser Trp Val Asp Thr Lys Asp Tyr Ser Val Thr Glu 
835 840 845 

Lys Tyr Ser Leu Glu Gin Glu He Val Gly Val Gly Met Ser Lys His 
850 855 860 

Pro Leu He Asp He Ala Glu Lys Ser Thr Gin Thr Phe Thr Pro He 
865 870 875 880 

Ser Gin Leu Val Lys Glu Ser Glu Ala Val Val Leu He Gin He Asp 
885 890 895 

Ser He Arg He He Arg Thr Lys Thr Ser Gly Gin Gin Met Ala Phe 
900 905 910 

Leu Ser Val Asn Asp Thr Lys Lys Lys Leu Asp Val Thr Leu Phe Pro 
915 920 925 

Gin Glu Tyr Ala He Tyr Lye Asp Gin Leu Lys Glu Gly Glu Phe Tyr 
930 935 940 

Tyr Leu Lys Gly Arg He Lys Glu Arg Asp His Arg Leu Gin Met Val 
945 950 955 960 

Cys Gin Gin Val Gin Met Ala He Ser Gin Lys Tyr Trp Leu Leu Val 
965 970 975 

Glu Asn His Gin Phe Asp Ser Gin He Ser Glu He Leu Gly Ala Phe 
980 985 990 
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Pro Gly Thr Thr Pro Val Val He His Tyr Gin Lys Asn Lys Glu Thr 
995 1000 1005 

He Ala Leu Thr Lye He Gin Val Thr Glu Aan Leu Lys Glu Lys Leu 
1010 1015 1020 

Arg Pro Phe Val Leu Lys Thr Val Phe Arg 
1025 1030 



<210> 21 
<211> 1038 
<212> DNA 

<213> Streptococcus pyogenes 
<400> 21 

atgattgcga tagaaaagat tgaaaaactg 
gtcacaggag atgacattgg tcagtatagc 
gcttttgata aggatgattt ggcctattct 
gatgcagaaa tggatctagt gagcctaccc 
gaccatttgt tagatatcac gaccaataaa 
gcctttgaag cctatttaga aaatccctta 
ggtaaattgg atagtaagag acggcttgtt 
gaagccaacc ctctgaaaga agcagagcta 
ctgggtttag gtttcgagag tggtgccttt 
tttagtcaaa tcatgaaaaa catggccttt 
agcctaactg atattgagca agccattcct 
actagacttg tcctaggagg taaaattgat 
ttatctggag aagatgacat taaattaatc 
ttgcagctga ctattcttgc tagagatgta 
tcagatattc ttgggcggcg ggttaatcct 
aggaccttat ctcttgcctt tctaacagga 
cagataaaaa caggacttta tgagaagagt 
atgactcact ctcaaaaa 



agtaaagaaa atttgggtct tataaccctt 60 
cagttgaaat cccgcttaat ggagcagatt 120 
tactttgata tgtctgaggc cgcttatcag 180 
ttctttgctg agcagaaggt ggttattttt 240 
aaaagtttct taaaagaaaa agacctaaag 3 00 
gagactactc gactaattat ctttgctcca 360 
aagcttttga aacgtgatgc ccttgtttta 420 
agaacttatt ttcaaaaata cagtcatcaa 480 
gaccaattac ttttgaaatc aaacgatgat 540 
ttaaaagcct ataaaaaaac gggaaatatt 600 
aaaagtttac aagataatat tttcgatctg 660 
gcggctagag atttgattca tgatttacgg 72 0 
gctatcatgc taggccaatt tcgcttattt 780 
aaaaacgagc aacaactagt gattagttta 840 
taccaggtca agtatgcgtt aaaggattct 900 
gcggtgaaaa ccttgattga gacagattac 960 
tatctagttg atattgctct cttaaaaatc 1020 
1038 



<210> 22 
<211> 346 
<212> PRT 

<213> Streptococcus pyogenes 
<400> 22 

Met lie Ala He Glu Lys He Glu Lys Leu Ser Lys Glu Asn Leu Gly 
15 10 15 

Leu He Thr Leu Val Thr Gly Asp Asp He Gly Gin Tyr Ser Gin Leu 
20 25 30 
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Lys Ser Arg Leu Met Glu Gin He Ala Phe Asp Lys Asp Asp Leu Ala 
35 40 45 

Tyr Ser Tyr Phe Asp Met Ser Glu Ala Ala Tyr Gin Asp Ala Glu Met 
50 55 60 

Asp Leu Val Ser Leu Pro Phe Phe Ala Glu Gin Lys Val Val He Phe 
65 70 75 80 

Asp His Leu Leu Asp He Thr Thr Asn Lys Lys Ser Phe Leu Lys Glu 
85 90 95 

Lys Asp Leu Lys Ala Phe Glu Ala Tyr Leu Glu Asn Pro Leu Glu Thr 
100 105 110 

Thr Arg Leu He He Phe Ala Pro Gly Lys Leu Asp Ser Lys Arg Arg 
115 120 125 

Leu Val Lys Leu Leu Lys Arg Asp Ala Leu Val Leu Glu Ala Asn Pro 
130 135 140 

Leu Lys Glu Ala Glu Leu Arg Thr Tyr Phe Gin Lys Tyr Ser His Gin 
145 150 155 160 

Leu Gly Leu Gly Phe Glu Ser Gly Ala Phe Asp Gin Leu Leu Leu Lys 
165 170 175 

Ser Asn Asp Asp Phe Ser Gin He Met Lys Asn Met Ala Phe Leu Lys 
180 185 190 

Ala Tyr Lys Lys Thr Gly Asn He Ser Leu Thr Asp He Glu Gin Ala 
195 200 205 

He Pro Lys Ser Leu Gin Asp Asn He Phe Asp Leu Thr Arg Leu Val 
210 215 220 

Leu Gly Gly Lys He Asp Ala Ala Arg Asp Leu He His Asp Leu Arg 
225 230 235 240 

Leu Ser Gly Glu Asp Asp He Lys Leu He Ala He Met Leu Gly Gin 
245 250 255 

Phe Arg Leu Phe Leu Gin Leu Thr He Leu Ala Arg Asp Val Lys Asn 
260 265 270 

Glu Gin Gin Leu Val He Ser Leu Ser Asp He Leu Gly Arg Arg Val 
275 280 285 
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Asn Pro Tyr Gin Val Lys Tyr Ala 

290 295 

Leu Ala Phe Leu Thr Gly Ala Val 
305 310 

Gin lie Lys Thr Gly Leu Tyr Glu 
325 

Leu Leu Lys lie Met Thr His Ser 
340 



Leu Lys Asp Ser Arg Thr Leu Ser 
300 

Lys Thr Leu lie Glu Thr Asp Tyr 
315 320 

Lys Ser Tyr Leu Val Asp lie Ala 
330 335 

Gin Lys 
345 



<210> 23 
<211> 873 
<212> DNA 

<213> Streptococcus pyogenes 
<400> 23 

atggatttag cgcaaaaagc tcctaacgtt 
gaccgtctga atcatgctta tcttttttcg 
tttttagcta aggtcatctt ttgtgaacag 
cgctcttgtc aattgattga acaaggagat 
gggcaagtga ttaaaacgga tgtggtcaaa 
tatgaaaaca aacgacaagt ttttattatc 
gctaatagct tgctaaaata cattgaggag 
accaatgatg ataacaaagt gcttccgacc 
cctaaaaacg aagcctatct ttaccaattg 
gctaagctag tagccaaact tgccacaaac 
agcaagcttt tagaactgat aactcaagca 
cagttgcagg catatttagc gttgaaccgt 
caagatttag ttttgaccct tttgaccttg 
ttgacacaat tggaggctgt ctatcaggct 
caaaacacat tagaatatat ggtgatgtca 



tatcaagctt ttcagacaat tttaaagaaa 60 
ggtgattttg ctaatgaaga aatggctctt 120 
aaaaaggatc agacgccctg cgggcattgt 180 
tttgccgatg tgacggtatt ggaaccaaca 240 
gaaatgatgg ctaacttttc tcagacagga 3 00 
aaagattgtg acaaaatgca tatcaatgcc 360 
cctcagggag aagcttacat atttttattg 420 
attaaaagtc ggacacaggt ttttcagttt 480 
gcacaagaaa agggattatt aaaccatcag 540 
accagtcatc tagaacgtct gttgcaaacg 600 
gagcgttttg tatctatttg gctgaaagat 660 
ctggtacagt tagcaactga aaaagaagaa 72 0 
ctcttggcaa gagagcgtgc gcaaacgcct 780 
aggctcatgt ggcagagcaa tgttaatttt 840 
gaa 87 3 



<210> 24 
<211> 291 
<212> PRT 

<213> Streptococcus pyogenes 
<400> 24 

Met Asp Leu Ala Gin Lys Ala Pro Asn Val Tyr Gin Ala Phe Gin Thr 
15 10 15 

lie Leu Lys Lys Asp Arg Leu Asn His Ala Tyr Leu Phe Ser Gly Asp 
20 25 30 
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Phe Ala Aen Glu Glu Met Ala Leu Phe Leu Ala Lys Val lie Phe Cys 
35 40 45 

Glu Gin Lya Lys Asp Gin Thr Pro CyB Gly His Cys Arg Ser Cys Gin 



Leu lie Glu Gin Gly Asp Phe Ala Asp Val Thr Val Leu Glu Pro Thr 
65 70 75 80 

Gly Gin Val lie Lys Thr Asp Val Val Lys Glu Met Met Ala Asn Phe 



Ser Gin Thr Gly Tyr Glu Asn Lys Arg Gin Val Phe He He Lys Asp 
100 105 110 

Cys Asp Lys Met His He Asn Ala Ala Asn Ser Leu Leu Lys Tyr He 
115 120 125 

Glu Glu Pro Gin Gly Glu Ala Tyr He Phe Leu Leu Thr Asn Asp Asp 
130 135 140 

Asn Lys Val Leu Pro Thr He Ly6 Ser Arg Thr Gin Val Phe Gin Phe 
145 150 155 160 

Pro Lys Asn Glu Ala Tyr Leu Tyr Gin Leu Ala Gin Glu LyB Gly Leu 
165 170 175 

Leu Asn His Gin Ala Lys Leu Val Ala LyB Leu Ala Thr Asn Thr Ser 
180 185 190 

His Leu Glu Arg Leu Leu Gin Thr Ser Lys Leu Leu Glu Leu He Thr 
195 200 205 

Gin Ala Glu Arg Phe Val Ser He Trp Leu Lys Asp Gin Leu Gin Ala 
210 215 220 

Tyr Leu Ala Leu Asn Arg Leu Val Gin Leu Ala Thr Glu Lys Glu Glu 
225 230 235 240 

Gin Asp Leu Val Leu Thr Leu Leu Thr Leu Leu Leu Ala Arg Glu Arg 
245 250 255 

Ala Gin Thr Pro Leu Thr Gin Leu Glu Ala Val Tyr Gin Ala Arg Leu 
260 265 270 

Met Trp Gin Ser Asn Val Asn Phe Gin Asn Thr Leu Glu Tyr Met Val 
275 280 285 
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Met Ser Glu 
290 



<210> 25 
<211> 1665 
<212> OKA 

<213> Streptococcus pyogenes 
<400> 25 

atgtatcaag ctctttatcg gaaataccgg agccaaacgt ttgacgaaat ggtgggacaa 60 
tcggttattt ccacaacttt aaagcaggca gttgaatctg gcaagattag ccatgcttat 120 
cttttttcag gtcctagagg gactgggaaa acaagtgcgg caaagatttt tgcaaaggcc 180 
atgaattgtc ctaaccaagt cgatggtgaa ccctgtaatc aatgcgatat ttgccgagat 240 
atcacgaatg gaagcttgga agatgtgatt gaaattgatg ctgcctcgaa taatggtgtt 300 
gatgaaattc gtgacattcg agacaaatca acctatgcgc caagtcgtgc gacttacaag 360 
gtttatatta ttgatgaggt tcacatgtta tcaacagggg cttttaatgc gcttttgaaa 420 
actttggaag aaccgacaga atgttgtctt tatcttggca acaacggaat gcataaaatt 480 
ccagccacta ttttatctcg tgtgcaacgc tttgaattca aagctattaa gcaaaaagct 540 
attcgagagc atttagcctg ggttttggac aaagaaggta ttgcctatga ggtggatgct 600 
ttaaatctca ttgcaaggcg agcagaagga ggcatgcgtg atgctttatc tattttagat 660 
caggctttga gcttgtcacc agataatcag gtcgccattg caattgccga agaaattaca 720 
ggttctattt ccatacttgc tctgggtgac tatgttcgat atgtctccca agaacaggct 780 
acgcaagctc tggcagcctt agagaccatt tatgatagtg ggaagagcat gagccgcttt 840 
gcgacagatt tattgaccta tctgcgtgat ttattggtgg ttaaagctgg cggcgacaat 900 
caacgtcagt cagctgtttt tgataccaat ttgtctctct cgatagatcg tatattccaa 960 
atgataacag ttgttactag tcatctccct gaaatcaaaa agggaaccca tcctcggatt 1020 
tatgccgaaa tgatgactat ccaattagct cagaaagagc agattttgtc ccaagtaaac 1080 
ttgtcaggag agttaatctc agagattgaa acgctcaaaa atgagttggc acaacttaaa 1140 
caacaattgt cgcagctcca atcgcgtcct gattcactgg caagatctga taaaacgaaa 1200 
cctaaaacca caagctacag ggttgatcgg gttaccattt tgaaaatcat ggaagaaacg 1260 
gttcgaaata gccaacaatc tcgacaatat ctagatgctc taaaaaatgc ttggaatgaa 132 0 
attctagata acatttctgc ccaagacaga gccttattga tgggctctga gcctgtctta 13 80 
gcaaatagtg agaatgcgat tttggctttc gaggctgcct ttaatgcaga acaagtcatg 1440 
agccgaaata atcttaatga tatgtttggt aacattatga gtaaagctgc tggtttttct 1500 
cccaatattc tggcagtacc aaggacagat tttcagcata ttcgtaagga atttgctcag 1560 
caaatgaaat cgcaaaaaga cagtgttcaa gaagaacaag aagtagcgct tgatattcca 1620 
gaagggtttg attttttgct cgataaaata aatactattg acgac 1665 



<210> 26 
<211> 555 
<212> PRT 

<213> Streptococcus pyogenes 
<400> 26 

Met Tyr Gin Ala Leu Tyr Arg Lys Tyr Arg Ser Gin Thr Phe Asp Glu 
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15 10 15 

Met Val Gly Gin Ser Val lie Ser Thr Thr Leu Lys Gin Ala Val Glu 
20 25 30 

Ser Gly Lye lie Ser Hie Ala Tyr Leu Phe Ser Gly Pro Arg Gly Thr 
35 40 45 

Gly Lys Thr Ser Ala Ala Lye lie Phe Ala Lye Ala Met Asn Cys Pro 
50 55 60 

Asn Gin Val Asp Gly Glu Pro Cys Asn Gin Cys Asp lie Cys Arg Asp 
65 70 75 80 

lie Thr Asn Gly Ser Leu Glu Asp Val lie Glu He Asp Ala Ala Ser 
85 90 95 

Asn Asn Gly Val Asp Glu lie Arg Asp He Arg Asp Lys Ser Thr Tyr 
100 105 110 

Ala Pro Ser Arg Ala Thr Tyr Lys Val Tyr He He Asp Glu Val His 
115 120 125 

Met Leu Ser Thr Gly Ala Phe Asn Ala Leu Leu Lys Thr Leu Glu Glu 
130 135 140 

Pro Thr Glu Asn Val Phe He Leu Ala Thr Thr Glu Leu His Lys He 
145 150 155 160 

Pro Ala Thr He Leu Ser Arg Val Gin Arg Phe Glu Phe Lys Ala He 
165 170 175 

Lys Gin Lys Ala He Arg Glu His Leu Ala Trp Val Leu Asp Lys Glu 
180 185 190 

Gly He Ala Tyr Glu Val Asp Ala Leu Asn Leu He Ala Arg Arg Ala 
195 200 205 

Glu Gly Gly Met Arg Asp Ala Leu Ser He Leu Asp Gin Ala Leu Ser 
210 215 220 

Leu Ser Pro Asp Asn Gin Val Ala He Ala He Ala Glu Glu He Thr 
225 230 235 240 

Gly Ser He Ser He Leu Ala Leu Gly Asp Tyr Val Arg Tyr Val Ser 
245 250 255 

Gin Glu Gin Ala Thr Gin Ala Leu Ala Ala Leu Glu Thr He Tyr Asp 
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Ser Gly Lys Ser Met Ser Arg Phe Ala Thr Asp Leu Leu Thr Tyr Leu 
275 280 285 

Arg Asp Leu Leu Val Val Lys Ala Gly Gly Asp Asn Gin Arg Gin Ser 
290 295 300 

Ala Val Phe Asp Thr Asn Leu Ser Leu Ser lie Asp Arg lie Phe Gin 
305 310 315 320 

Met He Thr Val Val Thr Ser His Leu Pro Glu He Lys Lys Gly Thr 
325 330 335 

His Pro Arg He Tyr Ala Glu Met Met Thr He Gin Leu Ala Gin Lys 
340 345 350 

Glu Gin He Leu Ser Gin Val Asn Leu Ser Gly Glu Leu He Ser Glu 
355 360 365 

He Glu Thr Leu Lys Asn Glu Leu Ala Gin Leu Lys Gin Gin Leu Ser 
370 375 380 

Gin Leu Gin Ser Arg Pro Asp Ser Leu Ala Arg Ser Asp Lys Thr Lys 
385 390 395 400 

Pro Lys Thr Thr Ser Tyr Arg Val Asp Arg Val Thr He Leu Lys He 
405 410 415 

Met Glu Glu Thr Val Arg Asn Ser Gin Gin Ser Arg Gin Tyr Leu Asp 
420 425 430 

Ala Leu Lys Asn Ala Trp Asn Glu He Leu Asp Asn He Ser Ala Gin 
435 440 445 

Asp Arg Ala Leu Leu Met Gly Ser Glu Pro Val Leu Ala Asn Ser Glu 
450 455 460 

Aen Ala He Leu Ala Phe Glu Ala Ala Phe Asn Ala Glu Gin Val Met 
465 470 475 480 

Ser Arg Asn Asn Leu Asn Asp Met Phe Gly Asn lie Met Ser Lys Ala 
485 490 495 

Ala Gly Phe Ser Pro Asn He Leu Ala Val Pro Arg Thr Asp Phe Gin 
500 505 510 



His He Arg Lys Glu Phe Ala Gin Gin Met Lys Ser Gin Lys Asp Ser 
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515 520 525 

Val Gin Glu Glu Gin Glu Val Ala Leu Asp lie Pro Glu Gly Phe Asp 
530 535 540 

Phe Leu Leu Asp Lys lie Asn Thr lie Asp Asp 
545 550 555 



<210> 27 
<211> 1134 
<212> DNA 

<213> Streptococcus pyogenes 
<400> 27 

atgattcaat tttcaattaa tcgcacatta tttattcatg ctttaaatac aactaaacgt 60 
gctattagca ctaaaaatgc cattcctatt ctttcatcaa taaaaattga agtcacttct 120 
acaggagtaa ctttaacagg gtctaacggt caaatatcaa ttgaaaacac tattcctgta 180 
agtaatgaaa atgctggttt gctaattacc tctccaggag ctattttatt agaagctagt 240 
ttttttatta atattatttc aagtttgcca gatattagta taaatgttaa agaaattgaa 300 
caacaccaag ttgttttaac cagtggtaaa tcagagatta ccttaaaagg aaaagatgtt 36 0 
gaccagtatc ctcgtctaca agaagtatca acagaaaatc ctttgatttt aaaaacaaaa 420 
ttattgaagt ctattattgc tgaaacagct tttgcagcca gtttacaaga aagtcgtcct 480 
attttaacag gagttcatat tgtattaagt aatcataaag attttaaagc agtagcgact 540 
gactctcatc gtatgagcca acgtttaatc actttggaca atacttcagc agatttgatg 600 
gtagttcttc caagtaaatc tttgagagaa ttttcagcag tatttacaga tgatattgag 660 
accgttgagg tatttttctc accaagccaa atcttgttca gaagtgaaca catttctttt 720 
tatacacgcc tcttagaagg aaattatccc gatacagacc gtttattaat gacagaattt 780 
gagacggagg ttgttttcaa tacccaatcc cttcgccacg ctatggaacg tgccttcttg 840 
atttctaatg ctactcaaaa tggtactgtt aagcttgaga ttactcaaaa tcatatttca 900 
gctcatgtta actcacctga ggttggtaag gtaaacgagg atttagatat tgttagtcag 960 
tctggtagtg atttaactat cagcttcaat ccaacttacc ttattgagtc tttaaaagct 1020 
attaaaagtg aaacagtaaa aattcatttc ttatcaccag ttcgaccatt caccctaaca 1080 
ccaggcgatg aggaagaaag ttttatccaa ttaattacac cagtacgaac aaac 1134 

<210> 28 
<211> 378 
<212> PRT 

<213> Streptococcus pyogenes 
<400> 28 

Met lie Gin Phe Ser lie Asn Arg Thr Leu Phe He His Ala Leu Asn 
15 10 15 

Thr Thr Lye Arg Ala He Ser Thr Lys Asn Ala He Pro He Leu Ser 

20 25 30 
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Ser lie Lys lie Glu Val Thr Ser Thr Gly Val Thr Leu Thr Gly Ser 
35 40 45 

Asn Gly Gin lie Ser He Glu Asn Thr He Pro Val Ser Asn Glu Asn 
50 55 60 

Ala Gly Leu Leu He Thr Ser Pro Gly Ala He Leu Leu Glu Ala Ser 
65 70 75 80 

Phe Phe He Asn He He Ser Ser Leu Pro Asp He Ser lie Asn Val 
85 90 95 

Lys Glu He Glu Gin His Gin Val Val Leu Thr Ser Gly Lys Ser Glu 
100 105 110 

He Thr Leu Lys Gly Lys Asp Val Asp Gin Tyr Pro Arg Leu Gin Glu 
115 120 125 

Val Ser Thr Glu Asn Pro Leu He Leu Lys Thr Lys Leu Leu Lys Ser 
130 135 140 

He lie Ala Glu Thr Ala Phe Ala Ala Ser Leu Gin Glu Ser Arg Pro 
145 150 155 160 

He Leu Thr Gly Val His He Val Leu Ser Asn His Lys Asp Phe Lys 
165 170 175 

Ala Val Ala Thr Asp Ser His Arg Met Ser Gin Arg Leu He Thr Leu 
180 185 190 

Asp Asn Thr Ser Ala Asp Leu Met Val Val Leu Pro Ser Lys Ser Leu 
195 200 205 

Arg Glu Phe Ser Ala Val Phe Thr Asp Asp He Glu Thr Val Glu Val 
210 215 220 

Phe Phe Ser Pro Ser Gin He Leu Phe Arg Ser Glu His He Ser Phe 
225 230 235 240 

Tyr Thr Arg Leu Leu Glu Gly Asn Tyr Pro Asp Thr Asp Arg Leu Leu 
245 250 255 

Met Thr Glu Phe Glu Thr Glu Val Val Phe Asn Thr Gin Ser Leu Arg 
260 265 270 

His Ala Met Glu Arg Ala Phe Leu He Ser Asn Ala Thr Gin Asn Gly 
275 280 285 
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Thr Val Lys Leu Glu lie Thr Gin Asn His lie Ser Ala Hie Val Asn 
290 295 300 

Ser Pro Glu Val Gly Lys Val Asn Glu Asp Leu Asp lie Val Ser Gin 
305 310 315 320 

Ser Gly Ser Asp Leu Thr He Ser Phe Asn Pro Thr Tyr Leu He Glu 
325 330 335 

Ser Leu Lys Ala He Lys Ser Glu Thr Val Lys He His Phe Leu Ser 
340 345 350 

Pro Val Arg Pro Phe Thr Leu Thr Pro Gly Asp Glu Glu Glu Ser Phe 
355 360 365 

He Gin Leu He Thr Pro Val Arg Thr Asn 
370 375 



<210> 29 
<211> 492 
<212> DNA 

<213> Streptococcus pyogenes 
<400> 29 

atgattaata atgtagtact agttggtcgc atgaccaagg atgcagaact tcgttacaca 60 
ccaagtcaag tagctgtggc taccttcaca cttgctgtta accgtacctt taaaagccaa 120 
aatggtgaac gcgaggcaga tttcattaac tgtgtgatct ggcgtcaacc ggctgaaaat 180 
ttagcgaact gggctaaaaa aggtgctttg atcggagtta cgggtcgtat tcatacacgt 240 
aactacgaaa accaacaagg acaacgtgtc tatgtaacag aagttgttgc agataatttc 3 00 
caaatgttgg aaagtcgtgc tacacgtgaa ggtggctcaa ctggctcatt taatggtggt 360 
tttaacaata acacttcatc atcaaacagt tactcagcgc ctgcacaaca aacgcctaac 420 
tttggaagag atgatagccc atttgggaac tcaaacccga tggatatctc agatgacgat 480 
cttccattct ag 492 



<210> 30 
<211> 163 
<212> PRT 

<213> Streptococcus pyogenes 
<400> 30 

Met He Asn Asn Val Val Leu Val Gly Arg Met Thr Lys Asp Ala Glu 
15 10 15 

Leu Arg Tyr Thr Pro Ser Gin Val Ala Val Ala Thr Phe Thr Leu Ala 
20 25 30 
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Val Asn Arg Thr Phe Lys Ser Gin Asn Gly Glu Arg Glu Ala Asp Phe 
35 40 45 

lie Asn Cys Val lie Trp Arg Gin Pro Ala Glu Asn Leu Ala Asn Trp 
50 55 60 

Ala Lys Lys Gly Ala Leu lie Gly Val Thr Gly Arg He Gin Thr Arg 
65 70 75 80 

Asn Tyr Glu Asn Gin Gin Gly Gin Arg Val Tyr Val Thr Glu Val Val 
85 90 95 

Ala Asp Asn Phe Gin Met Leu Glu Ser Arg Ala Thr Arg Glu Gly Gly 
100 105 110 

Ser Thr Gly Ser Phe Asn Gly Gly Phe Asn Asn Asn Thr Ser Ser Ser 
115 120 125 

Asn Ser Tyr Ser Ala Pro Ala Gin Gin Thr Pro Asn Phe Gly Arg Asp 
130 135 140 

Asp Ser Pro Phe Gly Asn Ser Asn Pro Met Asp He Ser Asp Asp Asp 
145 150 155 160 

Leu Pro Phe 



<210> 31 
<211> 1815 
<212> DNA 

<213> Streptococcus pyogenes 
<400> 31 

atgggatttt tatggggagg tgacgatttg 

aaaaatagcg ttaatattgt cgatgtcatt 

cggcattacc tcgggctttg cccatttcat 

gaagacagac aattttttca ctgctttggc 

attgaggaat accgccaagt ccccttctta 

ggtatgtcgc ttaatatacc gccaagtcag 

aatcacgctt tgatgacact tcatgaggat 

accactacca ttggtcaaga agctaggaag 

ttaattgagc atttcaatat tggtttagcc 

ctttctaaaa aatacgagga aggtcaattg 

caatccaata ccatttacga cgcctttcga 

cgagggcata ttattgcctt ttcaggacgt 

caggcaaagt ataaaaattc aagaggaaca 

catctggaca aggcaaggcc tgttattgcc 



gcaattgaca aagaaatgat ttcccaagta 60 
ggagaagtgg tcaaactttc ccgatcaggg 120 
aaggaaaaga caccctcttt taatgttgtt 180 
tgtggaaaat caggggatgt ttttaaattt 240 
gaaagtgttc agattattgc cgataagact 300 
gcagtacttg ctagccaaca caagcaccct 360 
gctgctaaat tttaccatgc agttttgatg 420 
tacctttacc agagaggctt ggatgaccaa 480 
ccagatgagt cagattatct ttatcaagct 540 
gttgcttcag gattgtttca cttgtccgat 600 
aatcgtatca tgtttccctt atcagatgac 660 
atctggacgg cagctgatat ggaaaagaga 720 
gttcttttta acaaatctta tgaattgtat 780 
aaaacccatg aagtgtttct aatggaaggg 84 0 
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tttatggacg tgattgccgc ttaccgttcc 
acggcattga ctcaagaaca tgtcaatcac 
atttatgatg gtgacgatgc tggacaacat 
gattttgttg tcgaaattgt cagaatcccc 
cggcattccc cagaagcatt tgcagatttg 
ttttttattg attacctaaa acctactaat 
gtggagaaaa tggcaccatt gattgctcaa 
attaacaaga ttgctgattt gttgccaaac 
aatgcattaa ggattcaaga taggcaaaaa 
aatcttgtga ccttaccaat gccaaaaagt 
ctcatgcatc ggctcttaca tcatgactat 
ttttattttg atacctctac cttagaatta 
attacatctt atgatttgtc agagatgtca 
ttagaagaaa accttcccaa agaagtagct 
cgtgccaaac ttttagcaga gcgcgatctt 
agtaacaaag gcgatcatca agcggctcta 
cgaaaaatgg aatag 



ggctatgaaa atgctgttgc ttcaatgggg 900 
cttaagcaag tcactaaaaa agttgttttg 960 
gctattgcaa aatcactaga attgcttaaa 1020 
aataaaatgg atcctgacga atttgtacaa 1080 
cttaagcagt cacggatcag tagtgttgaa 1140 
gtagacaatt tgcaatcaca aattgtttat 1200 
tcaccatcca tcacagctca acattcgtat 1260 
tttgactatt ttcaagtaga acaatcagta 1320 
catcaaggtc aaatagctca agccgtcagc 1380 
ttgacagcta ttgctaagac agaaagtcat 1440 
ttattaaatg aatttcgaca tcgtgatgat 1500 
ctttatcaac ggctgaagca acaaggacac 1560 
gaggaagtta accgtgctta ttacaatgtt 1620 
cttggtgaga ttgatgatat tttatccaaa 1680 
cacaaacaag ggaaaaaagt tagagaatct 1740 
gaagtactag aacattttat tgcgcagaaa 1800 
1815 



<210> 32 
<211> 600 
<212> PRT 

<213> Streptococcus pyogenes 
<400> 32 

Met Gly Phe Leu Trp Gly Gly Asp Asp Leu Ala lie Asp Lys Glu Met 
15 10 15 

lie Ser Gin Val Lys Asn Ser Val Asn lie Val Asp Val He Gly Glu 
20 25 30 

Val Val Lys Leu Ser Arg Ser Gly Arg His Tyr Leu Gly Leu Cys Pro 
35 40 45 

Phe His Lys Glu Lys Thr Pro Ser Phe Asn Val Val Glu Asp Arg Gin 
50 55 60 

Phe Phe His Cys Phe Gly Cys Gly Lys Ser Gly Asp Val Phe Lys Phe 
65 70 75 80 

He Glu Glu Tyr Arg Gin Val Pro Phe Leu Glu Ser Val Gin He lie 
85 90 95 

Ala Asp Lys Thr Gly Met Ser Leu Asn lie Pro Pro Ser Gin Ala Val 
100 105 110 

Leu Ala Ser Gin His Lye His Pro Asn His Ala Leu Met Thr Leu His 
115 120 125 
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Glu Asp Ala Ala Lys Phe Tyr Hie Ala Val Leu Met Thr Thr Thr lie 
130 135 140 

Gly Gin Glu Ala Arg Lys Tyr Leu Tyr Gin Arg Gly Leu Asp Asp Gin 
145 150 155 160 

Leu lie Glu His Phe Asn lie Gly Leu Ala Pro Asp Glu Ser Asp Tyr 
165 170 175 

Leu Tyr Gin Ala Leu Ser Lys Lys Tyr Glu Glu Gly Gin Leu Val Ala 
180 185 190 

Ser Gly Leu Phe His Leu Ser Asp Gin Ser Asn Thr lie Tyr Asp Ala 
195 200 205 

Phe Arg Asn Arg He Met Phe Pro Leu Ser Asp Asp Arg Gly His He 
210 215 220 

lie Ala Phe Ser Gly Arg lie Trp Thr Ala Ala Asp Met Glu Lys Arg 
225 230 235 240 

Gin Ala Lys Tyr Lys Asn Ser Arg Gly Thr Val Leu Phe Asn Lys Ser 
245 250 255 

Tyr Glu Leu Tyr His Leu Asp Lys Ala Arg Pro Val lie Ala Lys Thr 
260 265 270 

His Glu Val Phe Leu Met Glu Gly Phe Met Asp Val He Ala Ala Tyr 
275 280 285 

Arg Ser Gly Tyr Glu Asn Ala Val Ala Ser Met Gly Thr Ala Leu Thr 
290 295 300 

Gin Glu His Val Asn His Leu Lys Gin Val Thr Lys Lys Val Val Leu 
305 310 315 320 

He Tyr Asp Gly Asp Asp Ala Gly Gin His Ala He Ala Lys Ser Leu 
325 330 335 

Glu Leu Leu Lys Asp Phe Val Val Glu He Val Arg He Pro Asn Lye 
340 345 350 

Met Asp Pro Asp Glu Phe Val Gin Arg His Ser Pro Glu Ala Phe Ala 
355 360 365 

Asp Leu Leu Lys Gin Ser Arg He Ser Ser Val Glu Phe Phe He Asp 
370 375 380 
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Tyr Leu Lys Pro Thr Asn Val Asp Asn Leu Gin Ser Gin lie Val Tyr 
385 390 395 400 

Val Glu Lys Met Ala Pro Leu He Ala Gin Ser Pro Ser He Thr Ala 
405 410 415 

Gin His Ser Tyr He Asn Lys He Ala Asp Leu Leu Pro Asn Phe Asp 
420 425 430 

Tyr Phe Gin Val Glu Gin Ser Val Asn Ala Leu Arg He Gin Asp Arg 
435 440 445 

Gin Lys His Gin Gly Gin He Ala Gin Ala Val Ser Asn Leu Val Thr 
450 455 460 

Leu Pro Met Pro Lys Ser Leu Thr Ala He Ala Lys Thr Glu Ser His 
465 470 475 480 

Leu Met His Arg Leu Leu His His Asp Tyr Leu Leu Asn Glu Phe Arg 
485 490 495 

His Arg Asp Asp Phe Tyr Phe Asp Thr Ser Thr Leu Glu Leu Leu Tyr 
500 505 510 

Gin Arg Leu Lys Gin Gin Gly His He Thr Ser Tyr Asp Leu Ser Glu 
515 520 525 

Met Ser Glu Glu Val Asn Arg Ala Tyr Tyr Asn Val Leu Glu Glu Asn 
530 535 540 

Leu Pro Lys Glu Val Ala Leu Gly Glu He Asp Asp lie Leu Ser Lys 
545 550 555 560 

Arg Ala Lys Leu Leu Ala Glu Arg Asp Leu His Lys Gin Gly Lys Lys 
565 570 575 

Val Arg Glu Ser Ser Asn Lys Gly Asp His Gin Ala Ala Leu Glu Val 
580 585 590 

Leu Glu His Phe He Ala Gin Lys 
595 600 



<210> 33 

<211> 1368 

<212> DNA 

<213> Streptococcus pyogenes 
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<400> 33 

atgaggttgc ctgaagtagc tgaattacga 
tctgttcttg ggtcaatctt tatctcacct 
agtccagacg atttttataa gtacgctcat 
agcgatcgta atgatgccat tgatgcaacc 
gatctgcaaa gtattggtgg cttatcctat 
agtgctaatg cagaatatta tgctaaaatt 
attgctaggt tgacagaatc tgtcaaccta 
gttatcgctg gagttgagag agctttaatt 
tttcgcaaaa tttcagatgt gctaaaagtt 
cagacttcaa atgttacagg tttaccaact 
ggtttacacc cagatcaatt agttatttta 
tttgttctta atattgcgca aaatgtgggg 
tctttggaaa tgggtgctga aagtttagta 
gattcgcaca gtttaagaac agggcaactc 
gctcagggag ctttggcaga agcaccgatt 
actgaaatcc gcgcaagatc acggaaattg 
attgtaattg actacttaca gttgattaca 
gtttcagata tttcaagaca gcttaaaatc 
gccctaagtc agctttctcg tggcgttgag 
gatattcgtg aatcaggatc tattgagcag 
gacgattatt accgtaaaga atgtgatgat 
gaagttatcc tcgagaaaaa tagagctggg 
aaagaataca acaaattctc aagtatagcc 



gttcaacccc aagatttact agcagagcaa GO 
gataagctga ttgcagtgag agaatttatc 120 
aaaattatct ttcgggcaat gattaccctc 180 
actataagaa caatcctaga tgatcaagat 240 
attgttgaac tagttaatag tgtcccaact 3 00 
gtagctgaga aagctatgtt gcgtgatatt 360 
gcttatgatg aaattttaaa accagaagag 420 
gaactcaatg aacatagtaa tcgtagtggg 480 
aattacgagg ctttagaagc acgttctaag 540 
ggttttagag accttgacaa gattacaaca 600 
gctgctcggc cagcagtggg gaagactgcc 6 60 
actaagcaaa aaaagactgt tgctattttt 720 
gatcgtatgc ttgcagcaga aggaatggtt 7 80 
acagatcagg attggaataa tgtaacaatt 840 
tatattgacg atacgcccgg gattaaaatt 900 
tctcaagaag tggatggtgg tttaggtctc 9 60 
ggaactaaac ccgaaaatcg tcagcaagag 1020 
ctagctaaag aattgaaagt accagttatt 1080 
caaaggcaag ataaacgacc agttttatca 1140 
gatgccgata ttgtagcctt cttataccgg 1200 
gctgaagaag ctgttgaaga taacacaatt 1260 
gcgcgtggaa cagtcaaact gatgttccaa 1320 
cagtttgaag aaagataa 1368 



<210> 34 
<211> 455 
<212> PRT 

<213> Streptococcus pyogenes 
<400> 34 

Met Arg Leu Pro Glu Val Ala Glu 
1 5 

Leu Ala Glu Gin Ser Val Leu Gly 
20 

Leu lie Ala Val Arg Glu Phe He 

35 40 

Ala His Lys He He Phe Arg Ala 
50 55 

Asp Ala He Asp Ala Thr Thr He 
65 70 



Leu Arg Val Gin Pro Gin Asp Leu 
10 15 

Ser He Phe He Ser Pro Asp Lys 
25 30 

Ser Pro Asp Asp Phe Tyr Lys Tyr 
45 

Met He Thr Leu Ser Asp Arg Asn 
60 

Arg Thr He Leu Asp Asp Gin Asp 
75 80 
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Asp Leu Gin Ser He Gly Gly Leu Ser Tyr He Val Glu Leu Val Asn 



Ser Val Pro Thr Ser Ala Asn Ala Glu Tyr Tyr Ala Lys He Val Ala 
100 105 110 

Glu Lys Ala Met Leu Arg Asp He He Ala Arg Leu Thr Glu Ser Val 
115 120 125 

Asn Leu Ala Tyr Asp Glu He Leu Lys Pro Glu Glu Val He Ala Gly 
130 135 140 

Val Glu Arg Ala Gin Gly Ala Leu Ala Glu Ala Pro lie Tyr He Asp 
145 150 155 160 

Asp Thr Pro Gly He Lys He Ala Leu He Glu Leu Asn Glu His Ser 
165 170 175 

Asn Arg Ser Gly Phe Arg Lys He Ser Asp Val Leu Lys Val Asn Tyr 
180 185 190 

Glu Ala Leu Glu Ala Arg Ser Lye Gin Thr Ser Asn Val Thr Gly Leu 
195 200 205 

Pro Thr Gly Phe Arg Asp Leu Asp Lys He Thr Thr Gly Leu His Pro 
210 215 220 

Asp Gin Leu Val He Leu Ala Ala Arg Pro Ala Val Gly Lys Thr Ala 
225 230 235 240 

Phe Val Leu Asn He Ala Gin Asn Val Gly Thr Lys Gin Lys Lys Thr 
245 250 255 

Val Ala He Phe Ser Leu Glu Met Gly Ala Glu Ser Leu Val Asp Arg 
260 265 270 

Met Leu Ala Ala Glu Gly Met Val Asp Ser His Ser Leu Arg Thr Gly 
275 280 285 

Gin Leu Thr Asp Gin Asp Trp Asn Asn Val Thr He Thr Glu He Arg 
290 295 300 

Ala Arg Ser Arg Lys Leu Ser Gin Glu Val Asp Gly Gly Leu Gly Leu 
305 310 315 320 

He Val He Asp Tyr Leu Gin Leu He Thr Gly Thr Lys Pro Glu Asn 
325 330 335 
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Arg Gin Gin Glu Val Ser Asp He Ser Arg Gin Leu Lys He Leu Ala 
340 345 350 

Lys Glu Leu Lys Val Pro Val He Ala Leu Ser Gin Leu Ser Arg Gly 
355 360 365 

Val Glu Gin Arg Gin Asp Lye Arg Pro Val Leu Ser Asp He Arg Glu 
370 375 380 

Ser Gly Ser He Glu Gin Asp Ala Asp He Val Ala Phe Leu Tyr Arg 
385 390 395 400 

Asp Asp Tyr Tyr Arg Lys Glu Cys Asp Asp Ala Glu Glu Ala Val Glu 
405 410 415 

Asp Asn Thr He Glu Val He Leu Glu Lys Asn Arg Ala Gly Ala Arg 
420 425 430 

Gly Thr Val Lys Leu Met Phe Gin Lys Glu Tyr Asn Lys Phe Ser Ser 
435 440 445 

He Ala Gin Phe Glu Glu Arg 
450 455 



<210> 35 
<211> 29 
<212> DKA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 35 

ggtggtaatt gtcttgcata tgacagagc 29 

<210> 36 
<211> 31 
<212> DKA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 36 

agcgattaag tggattgccg ggttgtgatg c 31 
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<210> 37 
<211> 31 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: primer 
<400> 37 

agcatcacaa cccggcaatc cacttaatcg c 31 



<210> 38 
<211> 29 
<212> DKA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: primer 
<400> 38 

gactacgcca tgggcattaa ataaatacc 29 



<210> 39 

<211> 30 

<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: primer 
<400> 39 

gaagatgcat ataaacgtgc aagacctagt 30 



<210> 40 
<211> 34 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: primer 
<400> 40 

gtctgacgca cgaattgtaa agtaagatgc atag 34 
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<210> 41 
<211> 36 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: primer 
<400> 41 

cgactggaag gagttttaac atatgatgga attcac 



<210> 42 
<211> 30 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: primer 
<400> 42 

ttatatggat ccttagtaag ttctgattgg 



<210> 43 
<211> 15 
<212> PRT 

<213> Escherichia coli 



<400> 43 

Leu Leu Phe Glu Arg Phe Leu Asn Pro Glu Arg Val Ser Met Pro 
1 5 10 15 



<210> 44 
<211> 15 
<212> PRT 

<213> Escherichia coli 



<400> 44 

Lys Phe Ala Gly Tyr Gly Phe Asn Lys Ser His Ser Ala Ala Tyr 
15 10 15 



<210> 45 
<211> 44 
<212> DNA 
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<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 45 

cttctttttg aaagatttct aaataaagaa cgttattcaa tgcc 44 



<210> 46 
<211> 45 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Deecription of Artificial Sequence: primer 
<400> 46 

ataagctgca gcatgacttt tattaaaacc ataacctgca aattt 45 



<210> 47 
<211> 39 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 47 

agttaaaaat gccatatttt gacgtgtttt agttctaat 39 



<210> 48 
<211> 42 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 48 

cttgcaaaag cggttgctaa agatgttgga cgaattatgg gg 42 

<210> 49 
<211> 10 
<212> PRT 
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<213> Escherichia coli 
<400> 49 

His Ala Tyr Leu Phe Ser Gly Pro Arg Gly 
1 5 10 



<210> 50 
<211> 10 
<212> PRT 

<213> Escherichia coli 
<400> 50 

His Ala Tyr Leu Phe Ser Gly Pro Arg Gly 
15 10 



<210> 51 
<211> 38 
<212> DKA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 51 

cgcggatccc atgcatattt attttcaggt ccaagagg 3 8 

<210> 52 
<211> 39 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 52 

ccggaattct ggtggttctt ctaatgtttt taataatgc 39 

<210> 53 
<211> 38 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
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<400> 53 

tttgtaaagg cattacgcag gggactaatt cagatgtg 3 8 

<210> 54 
<211> 33 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description o£ Artificial Sequence: primer 
<400> 54 

tatgacattc attacaaggt tctccatcag tgc 33 



<210> 55 
<211> 29 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 55 

gagcactgat gaacttagaa ttagatatg 29 

<210> 56 
<211> 32 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 56 

gatactcagt atctttctca gatgttttat tc 32 

<210> 57 
<211> 14 
<212> PRT 

<213> Escherichia coli 
<400> 57 

Aep Leu lie lie Val Ala Ala Arg Pro Ser Met Gly Lys Thr 
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15 10 



<210> 58 
<211> 15 
<212> PRT 

<213> Escherichia coli 
<400> 58 

Glu lie He He Gly Lys Gin Arg Asn Gly Pro He Gly Thr Val 
15 10 15 

<210> 59 
<211> 41 
<212> DMA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 59 

gaccttataa ttgtagctgc acgtccttct atgggaaaaa c 41 

<210> 60 

<211> 48 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 

<400> 60 

aacattatta agtcagcatc ttgttctatt gatccagatt caacgaag 48 

<210> 61 
<211> 45 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 61 

gatttgtagt tctggtaatg ttgactcaaa ccgcttaaga accgg 45 
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<210> 62 
<211> 48 
<212> DNA 

<213> Artificial Sequence 



c220> 

<223> Description of Artificial Sequence: primer 
<400> 62 

atacgtgtgg ttaactgatc agcaacccat ctctagtgag aaaatacc 48 



<210> 63 
<211> 31 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: primer 
<400> 63 

cgttttaatg catgcttaga aacgatatca g 31 



<210> 64 
<211> 31 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: primer 
<400> 64 

cattgctaag caacgttacg gtccaacagg c 31 



<210> 65 
<211> 69 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: primer 
<400> 65 

ggataacaat tccccgctag caataatttt gtttaacttt aagaaggaga tatacccatg 60 
gatgaacag 69 
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<210> 66 
<211> 39 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 66 

aattttaaag gatccgtgta taatattcta attttcccg 39 

<210> 67 
<211> 28 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 67 

gggagtttgt aatccatgga tgaacagc 28 

<210> 68 
<211> 37 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 68 

ctgaacacct attaccctag gcatctaact cacaccc 37 

<210> 69 
<211> 38 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 69 

ggagcagatt gcttttgata catatgattg gcctattc 38 
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<210> 70 
<211> 46 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 70 

ttgtctccgc atcaaactgg gatccaagag catcatacgc gtatgg 46 

<210> 71 
<211> 36 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 71 

gcctaggata agggagggta catatggatt tagcgc 36 

<210> 72 
<211> 44 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 72 

cgggcaagtc ttttgacaag cttcggatcc ccataacgaa ttcc 44 

<210> 73 
<211> 33 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 73 

ggagttaaaa acatatgtat caagctcttt ate 33 
67 



WO (11/09164 



PCT/L'S(IO/20666 



<210> 74 
<211> 38 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 74 

cgtgggtaag ggcaaaacgg atcccttatg tatttcag 38 

<210> 75 
<211> 34 
<212> DKA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 75 

ggagttcata tgattcaatt ttcaaattaa tcgc 34 

<210> 76 
<211> 38 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 76 

tatcagctcc tggatccagt accttccatt gattagcc 38 

<210> 77 
<211> 74 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 77 

ggataacaat tccccgctag caataatttt gtttaacttt aagaaggaga tatacccatg 60 
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<210> 78 
<211> 57 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 78 

cggtgtctct atctaaatga ctcatttggg atcctcgctt tatacggtat gtcacag 57 

<210> 79 
<211> 43 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: primer 
<400> 79 

gggaacaaga taaccaagga ggaacccatg gttgctcaac ttg 43 

<210> 80 
<211> 40 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 80 

cgaatagcag cgttcatacc aggatcctcg ccgccactgg 40 

<210> 81 
<211> 49 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 81 
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accattttgg cttttaaagg tacggttaac agcaagtgtg aaggtagcc 49 

<210> 82 
<211> 38 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 82 

gaacgcgagg cagatttcat taactgtgtg atctggcg 38 

<210> 83 
<211> 48 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 83 

tttaaaagag ggtagcatat gattaataat gtagtactag ttggtcgc 48 

<210> 84 
<211> 51 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 84 

tttaaattta aacctaggtt caatccattc tgactagaat ggaagatcgt c 51 
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